Emmanuel wrote:
Hi Guys,We manage a counter to check how many time the URL has been consecutively in state Retry following some trouble to get the page. Here is a sample of the code: case ProtocolStatus.RETRY: // retry fit.datum.setRetriesSinceFetch( fit.datum.getRetriesSinceFetch()+1); However i notice that we don't reinitialise this counter at 0 in the case of successful fetch. Is there any reason, any logic that i don't understand ? or should we reset the counter everytime we have "case ProtocolStatus.SUCCESS: " ?
Sigh ... looks like a bug. Good catch! I think that the re-setting of the retry counter was omitted in one of the patches that refactored the CrawlDbReducer - I clearly remember it was there some time ago ...
-- Best regards, Andrzej Bialecki <>< ___. ___ ___ ___ _ _ __________________________________ [__ || __|__/|__||\/| Information Retrieval, Semantic Web ___|||__|| \| || | Embedded Unix, System Integration http://www.sigram.com Contact: info at sigram dot com
