Hi Andrés, hi Roannel, that's correct but the question was why the effective delay is "bigger" than the configured 2.5 sec.
Nutch implements the delay as sleeping time after one document has been fetched / before the next document is fetched. The observed 4-5 sec. include the time spent for fetching + the delay. In case a "Crawl-delay" is specified in robots.txt, the configured delay is overwritten by the value from robots.txt. Although crawlers (or search engines) may differ in the definition of the "Crawl-delay", at least, some use it exactly in the sense Nutch does, cf. https://yandex.com/support/webmaster/controlling-robot/robots-txt.xml#crawl-delay Sebastian On 11/23/2015 03:52 PM, Roannel Fernández Hernández wrote: > Hi Andrés: > > The fetcher.server.delay property as its description says is the number of > seconds the fetcher will delay between successive requests to the same > server. So, if you configure the fetcher.server.delay property with 2.5 as > value, Nutch will wait for 2.5 seconds to make another request to the same > server and not between different servers. > > Regards. > > ----- Mensaje original ----- >> De: "Andrés Rincón Pacheco" <[email protected]> >> Para: [email protected] >> Enviados: Viernes, 20 de Noviembre 2015 18:35:48 >> Asunto: [MASSMAIL]fetcher.server.delay configuration not working >> >> Hi, >> >> I configured the fetcher.server.delay property with 2.5 as value, but when >> nutch is fetching urls, the time fetching between urls is bigger that value >> configured. >> >> I attach some information of execution. >> >> Date fetching Difference in seconds >> 2015-11-11 20:56:49,967 5 >> 2015-11-11 20:56:54,746 5 >> 2015-11-11 20:56:59,391 4 >> 2015-11-11 20:57:04,264 5 >> 2015-11-11 20:57:09,212 5 >> 2015-11-11 20:57:13,873 5 >> 2015-11-11 20:57:18,549 5 >> >> and some lines of log. >> >> 2015-11-11 20:56:21,674 INFO fetcher.Fetcher - -activeThreads=50, >> spinWaiting=49, fetchQueues.totalSize=359, fetchQueues.getQueueCount=1 >> 2015-11-11 20:56:22,674 INFO fetcher.Fetcher - -activeThreads=50, >> spinWaiting=49, fetchQueues.totalSize=359, fetchQueues.getQueueCount=1 >> 2015-11-11 20:56:23,675 INFO fetcher.Fetcher - -activeThreads=50, >> spinWaiting=50, fetchQueues.totalSize=359, fetchQueues.getQueueCount=1 >> 2015-11-11 20:56:24,676 INFO fetcher.Fetcher - -activeThreads=50, >> spinWaiting=50, fetchQueues.totalSize=359, fetchQueues.getQueueCount=1 >> 2015-11-11 20:56:25,677 INFO fetcher.Fetcher - -activeThreads=50, >> spinWaiting=50, fetchQueues.totalSize=359, fetchQueues.getQueueCount=1 >> 2015-11-11 20:56:25,981 INFO fetcher.Fetcher - fetching http://www.abcderer >> (queue crawl delay=2500ms) >> 2015-11-11 20:56:26,677 INFO fetcher.Fetcher - -activeThreads=50, >> spinWaiting=49, fetchQueues.totalSize=358, fetchQueues.getQueueCount=1 >> 2015-11-11 20:56:27,678 INFO fetcher.Fetcher - -activeThreads=50, >> spinWaiting=49, fetchQueues.totalSize=358, fetchQueues.getQueueCount=1 >> 2015-11-11 20:56:28,679 INFO fetcher.Fetcher - -activeThreads=50, >> spinWaiting=50, fetchQueues.totalSize=358, fetchQueues.getQueueCount=1 >> 2015-11-11 20:56:29,679 INFO fetcher.Fetcher - -activeThreads=50, >> spinWaiting=50, fetchQueues.totalSize=358, fetchQueues.getQueueCount=1 >> 2015-11-11 20:56:30,680 INFO fetcher.Fetcher - -activeThreads=50, >> spinWaiting=50, fetchQueues.totalSize=358, fetchQueues.getQueueCount=1 >> 2015-11-11 20:56:30,791 INFO fetcher.Fetcher - fetching http://www.abcderer >> (queue crawl delay=2500ms) >> 2015-11-11 20:56:31,681 INFO fetcher.Fetcher - -activeThreads=50, >> spinWaiting=49, fetchQueues.totalSize=357, fetchQueues.getQueueCount=1 >> 2015-11-11 20:56:32,681 INFO fetcher.Fetcher - -activeThreads=50, >> spinWaiting=49, fetchQueues.totalSize=357, fetchQueues.getQueueCount=1 >> 2015-11-11 20:56:33,682 INFO fetcher.Fetcher - -activeThreads=50, >> spinWaiting=50, fetchQueues.totalSize=357, fetchQueues.getQueueCount=1 >> 2015-11-11 20:56:34,683 INFO fetcher.Fetcher - -activeThreads=50, >> spinWaiting=50, fetchQueues.totalSize=357, fetchQueues.getQueueCount=1 >> 2015-11-11 20:56:35,684 INFO fetcher.Fetcher - -activeThreads=50, >> spinWaiting=50, fetchQueues.totalSize=357, fetchQueues.getQueueCount=1 >> 2015-11-11 20:56:35,960 INFO fetcher.Fetcher - fetching http://www.abcderer >> (queue crawl delay=2500ms) >> 2015-11-11 20:56:36,684 INFO fetcher.Fetcher - -activeThreads=50, >> spinWaiting=49, fetchQueues.totalSize=356, fetchQueues.getQueueCount=1 >> 2015-11-11 20:56:37,685 INFO fetcher.Fetcher - -activeThreads=50, >> spinWaiting=49, fetchQueues.totalSize=356, fetchQueues.getQueueCount=1 >> 2015-11-11 20:56:38,686 INFO fetcher.Fetcher - -activeThreads=50, >> spinWaiting=50, fetchQueues.totalSize=356, fetchQueues.getQueueCount=1 >> 2015-11-11 20:56:39,687 INFO fetcher.Fetcher - -activeThreads=50, >> spinWaiting=50, fetchQueues.totalSize=356, fetchQueues.getQueueCount=1 >> 2015-11-11 20:56:40,552 INFO fetcher.Fetcher - fetching http://www.abcderer >> (queue crawl delay=2500ms) >> 2015-11-11 20:56:40,688 INFO fetcher.Fetcher - -activeThreads=50, >> spinWaiting=49, fetchQueues.totalSize=355, fetchQueues.getQueueCount=1 >> 2015-11-11 20:56:41,689 INFO fetcher.Fetcher - -activeThreads=50, >> spinWaiting=49, fetchQueues.totalSize=355, fetchQueues.getQueueCount=1 >> 2015-11-11 20:56:42,690 INFO fetcher.Fetcher - -activeThreads=50, >> spinWaiting=50, fetchQueues.totalSize=355, fetchQueues.getQueueCount=1 >> 2015-11-11 20:56:43,691 INFO fetcher.Fetcher - -activeThreads=50, >> spinWaiting=50, fetchQueues.totalSize=355, fetchQueues.getQueueCount=1 >> 2015-11-11 20:56:44,691 INFO fetcher.Fetcher - -activeThreads=50, >> spinWaiting=50, fetchQueues.totalSize=355, fetchQueues.getQueueCount=1 >> 2015-11-11 20:56:45,128 INFO fetcher.Fetcher - fetching http://www.abcderer >> (queue crawl delay=2500ms) >> 2015-11-11 20:56:45,692 INFO fetcher.Fetcher - -activeThreads=50, >> spinWaiting=49, fetchQueues.totalSize=354, fetchQueues.getQueueCount=1 >> 2015-11-11 20:56:46,693 INFO fetcher.Fetcher - -activeThreads=50, >> spinWaiting=49, fetchQueues.totalSize=354, fetchQueues.getQueueCount=1 >> 2015-11-11 20:56:47,694 INFO fetcher.Fetcher - -activeThreads=50, >> spinWaiting=50, fetchQueues.totalSize=354, fetchQueues.getQueueCount=1 >> 2015-11-11 20:56:48,695 INFO fetcher.Fetcher - -activeThreads=50, >> spinWaiting=50, fetchQueues.totalSize=354, fetchQueues.getQueueCount=1 >> 2015-11-11 20:56:49,696 INFO fetcher.Fetcher - -activeThreads=50, >> spinWaiting=50, fetchQueues.totalSize=354, fetchQueues.getQueueCount=1 >> 2015-11-11 20:56:49,967 INFO fetcher.Fetcher - fetching http://www.abcderer >> (queue crawl delay=2500ms) >> 2015-11-11 20:56:50,697 INFO fetcher.Fetcher - -activeThreads=50, >> spinWaiting=49, fetchQueues.totalSize=353, fetchQueues.getQueueCount=1 >> 2015-11-11 20:56:51,698 INFO fetcher.Fetcher - -activeThreads=50, >> spinWaiting=49, fetchQueues.totalSize=353, fetchQueues.getQueueCount=1 >> 2015-11-11 20:56:52,699 INFO fetcher.Fetcher - -activeThreads=50, >> spinWaiting=50, fetchQueues.totalSize=353, fetchQueues.getQueueCount=1 >> 2015-11-11 20:56:53,700 INFO fetcher.Fetcher - -activeThreads=50, >> spinWaiting=50, fetchQueues.totalSize=353, fetchQueues.getQueueCount=1 >> 2015-11-11 20:56:54,701 INFO fetcher.Fetcher - -activeThreads=50, >> spinWaiting=50, fetchQueues.totalSize=353, fetchQueues.getQueueCount=1 >> 2015-11-11 20:56:54,746 INFO fetcher.Fetcher - fetching http://www.abcderer >> (queue crawl delay=2500ms) >> >> What is the wrong in the configuration? >> >> Thanks. >> > Noviembre 13-14: Final Caribeña 2015 del Concurso de Programación ACM-ICPC > https://icpc.baylor.edu/regionals/finder/cf-2015 >

