I had a large fetch abort, and I'd like to recover the fetched pages (rather
than restarting from the beginning). 

This must be a fairly common need, but after a long search, I haven't been
able to find a clear answer.  I have followed the instructions found here
(http://wiki.apache.org/nutch/FAQ#How_can_I_recover_an_aborted_fetch_process.3F),
but when I run updatedb, the segment that was aborted is skipped. 

With Nutch 1.1, how can I recover the pages that have already been fetched?  

I would be most grateful to anyone who can point me at a resource with the
answer or who could provide the steps here. 

Thanks for your help!

Hal
-- 
View this message in context: 
http://lucene.472066.n3.nabble.com/Aborted-Fetch-Recovering-Fetched-Pages-Nutch-1-1-tp992947p992947.html
Sent from the Nutch - User mailing list archive at Nabble.com.

Reply via email to