subject:"How come I have so many retries listed in stats\?"

Re: How come I have so many retries listed in stats?

2010-01-10 Thread Julien Nioche

Hi Jesse, The number of retries is encoded in a byte so I suppose that it turns into negative values when it goes beyond 128. As for* db.fetch.retry.max* it is used only in the crawlDB reducer to mark a URL as gone after x unsuccessful attempts. From what I understand URLs marked as gone just get

How come I have so many retries listed in stats?

2010-01-09 Thread Jesse Hires

In nutch-default.xml I have the following db.fetch.retry.max 3 The maximum number of times a url that has encountered recoverable errors is generated for fetch. Yet after letting things run for some time, if I look at the stats I have the following... Is there some other setting I shoul