Emmanuel wrote:
Hi Guys,

We manage a counter to check how many time the URL has been consecutively in
state Retry following some trouble to get the page.

Here is a sample of the code:

case ProtocolStatus.RETRY:          // retry
                fit.datum.setRetriesSinceFetch(
fit.datum.getRetriesSinceFetch()+1);

However i notice that we don't reinitialise this counter at 0 in the case of
successful fetch.

Is there any reason, any logic that i don't understand ? or should we reset
the counter everytime we have "case ProtocolStatus.SUCCESS:  " ?

Sigh ... looks like a bug. Good catch! I think that the re-setting of the retry counter was omitted in one of the patches that refactored the CrawlDbReducer - I clearly remember it was there some time ago ...




--
Best regards,
Andrzej Bialecki     <><
 ___. ___ ___ ___ _ _   __________________________________
[__ || __|__/|__||\/|  Information Retrieval, Semantic Web
___|||__||  \|  ||  |  Embedded Unix, System Integration
http://www.sigram.com  Contact: info at sigram dot com

Reply via email to