Doug,
1. Use the aborted output. You'll need to touch the file fetcher.done in the segment directory. All the pages that were not crawled will be re-generated for fetch pretty soon. If you fetched lots of pages, and don't want to have to re-fetch them again, this is the best way.

Just as a not i do not have a fetcher.done file:

:/projects/nutch-0.4-dev joa$ find . -name fetch* -print
./docs/api/net/nutch/fetcher
./segments/20040518024340/fetcher
./segments/20040518024340/fetcher_content
./segments/20040518024340/fetcher_text
./segments/20040518024340/fetchlist

Bo problem for me since i just test and play around but may be good to know that this file is not there after a strg + c abort.

However i post your answer to the wiki/faq

Stefan



-------------------------------------------------------
This SF.Net email is sponsored by: SourceForge.net Broadband
Sign-up now for SourceForge Broadband and get the fastest
6.0/768 connection for only $19.95/mo for the first 3 months!
http://ads.osdn.com/?ad_id=2562&alloc_id=6184&op=click
_______________________________________________
Nutch-developers mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/nutch-developers

Reply via email to