i have checked, but haven't discovered Why the outlinks were not fetched. In
fact, i want to crawl the links more round like that:
- The first round crawls only urls(R1) which are defined in seed list. This
round discovers the set of outlinks(Ro1).
- The second round crawls *only set Ro1*, it discovers the set of
outlinks(Ro2).
....
Is there a script or configuration to do that ?



--
View this message in context: 
http://lucene.472066.n3.nabble.com/Iterative-Crawling-tp4046501p4047571.html
Sent from the Nutch - User mailing list archive at Nabble.com.

Reply via email to