Varying Number of URLS Crawled.

Nagarjun Pola Thu, 12 Feb 2015 10:41:36 -0800

Hi Everyone,

I started to use Nutch 1.10 for my homework and I see that every time I
perform a crawl using the same configuration and same seed urls I get a
different number of fetched urls. This occurs even when the old crawl data
is deleted.


This way I would not be able to identify which URLs had a problem being
fetched and if it was resolved later or not.

Any suggestions on how to solve this issue would be of great help.

Thank You.

Best,
Nagarjun Pola
University of Southern California

Varying Number of URLS Crawled.

Reply via email to