Thanks may be we should copy paste that in the wiki.
If I'm allowed i will do that.


Am 18.05.2004 um 21:24 schrieb Doug Cutting:

You have two choices:

1. Use the aborted output. You'll need to touch the file fetcher.done in the segment directory. All the pages that were not crawled will be re-generated for fetch pretty soon. If you fetched lots of pages, and don't want to have to re-fetch them again, this is the best way.

2. Discard the aborted output. To do this, just delete the fetcher* directories in the segment and restart the fetcher.

Doug

Stefan Groschupf wrote:
Hi,
hmm sorry, shame on me I do not know how to recover a fetch process.
I had abort the fetch process and wish to continue now and get this exception:
Exception in thread "main" java.io.IOException: already exists: segments/20040518024340/fetcher
at net.nutch.io.MapFile$Writer.<init>(MapFile.java:66)
at net.nutch.io.MapFile$Writer.<init>(MapFile.java:55)
at net.nutch.io.ArrayFile$Writer.<init>(ArrayFile.java:19)
at net.nutch.fetcher.RequestScheduler.main(RequestScheduler.java:1427)
Is there any chance to recove? I would write a small tool if necessary to recover the data, but where to start and what is to do?
Thanks for any hints,
Stefan
---------------------------------------------------------------
open technology: http://www.media-style.com
open source: http://www.weta-group.net
open discussion: http://www.text-mining.org
-------------------------------------------------------
This SF.Net email is sponsored by: SourceForge.net Broadband
Sign-up now for SourceForge Broadband and get the fastest
6.0/768 connection for only $19.95/mo for the first 3 months!
http://ads.osdn.com/?ad_id=2562&alloc_id=6184&op=click
_______________________________________________
Nutch-developers mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/nutch-developers


-------------------------------------------------------
This SF.Net email is sponsored by: SourceForge.net Broadband
Sign-up now for SourceForge Broadband and get the fastest
6.0/768 connection for only $19.95/mo for the first 3 months!
http://ads.osdn.com/?ad_id=2562&alloc_id=6184&op=click
_______________________________________________
Nutch-developers mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/nutch-developers


---------------------------------------------------------------
open technology:   http://www.media-style.com
open source:           http://www.weta-group.net
open discussion:    http://www.text-mining.org



-------------------------------------------------------
This SF.Net email is sponsored by: SourceForge.net Broadband
Sign-up now for SourceForge Broadband and get the fastest
6.0/768 connection for only $19.95/mo for the first 3 months!
http://ads.osdn.com/?ad_id=2562&alloc_id=6184&op=click
_______________________________________________
Nutch-developers mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/nutch-developers

Reply via email to