Hi IT_ailen:
       I know what 404 means and I also know adaptive fetch schedule. But I
want to know what Nutch will do when it meet some exceptions by recrawl.
Still an example, a same page was fetched successfully and recrawled for
three times. In all three times of recrawl, it returns 404 or other
exceptions. Will Nutch uses exception page info to update the former
successful page?



--
View this message in context: 
http://lucene.472066.n3.nabble.com/What-is-the-Nutch-page-update-mechanism-after-recrawl-tp4002366p4002373.html
Sent from the Nutch - User mailing list archive at Nabble.com.

Reply via email to