same page fetched severals times in one crawl

Pierre Nogues Sat, 13 Oct 2012 10:08:00 -0700

Hello,

I'm using nutch 2.1 with mysql and when I do a simple "bin/nutch crawl seed/ 
-depth 5 -topN 10000", I noticed nutch fetch 3 or 4 times the same URL during 
the crawl, why ?


I just configured nutch to local crawl a website (restriction in 
regex-urlfilter), everything else looks ok on mysql.  

nuch-site.xml : http://pastebin.com/Mx9s5Kfz

same page fetched severals times in one crawl

Reply via email to