i have checked, but haven't discovered Why the outlinks were not fetched. In fact, i want to crawl the links more round like that: - The first round crawls only urls(R1) which are defined in seed list. This round discovers the set of outlinks(Ro1). - The second round crawls *only set Ro1*, it discovers the set of outlinks(Ro2). .... Is there a script or configuration to do that ?
-- View this message in context: http://lucene.472066.n3.nabble.com/Iterative-Crawling-tp4046501p4047571.html Sent from the Nutch - User mailing list archive at Nabble.com.

