Hello:
 
 
 
  I am using nutch1.2 to crawl web pages ,and encounter a problem .In the class 
of Generator ,I found a argument which called adddays , and in the wiki I found 
it used for recrawl. The default value of “db.fetch.interval.default” is 30 
days ,so I set the adddays 31.I think this could be OK .But when I run the 
crawl task a few of time ,I found that when I set the depth to 30 it could run 
generate and fetch task 30 times .In fact the max of depth is only 12.
 
And every segment file size is almost the same .I read the source code but 
could not fount what’s wrong with it, so I need some help from you . 
 
 
 
Thank you!

Reply via email to