Re: Recovering aborted fetch

Mathijs Homminga Tue, 27 Feb 2007 04:21:51 -0800

Hi Andrzej,

Thanks for the tool!

I found one 'map_xxxxxx' directory which matches the date my segment wascreated.It contains a 'part-0.out' file with a timestamp that matches the timeof the last entries in my log file (just before the process stopped).

I followed the preparation steps and ran the tool. However, I got thefollowing error:

2007-02-27 11:27:00,416 WARN mapred.LocalJobRunner(LocalJobRunner.java:run(120)) - job_sygdrxjava.io.IOException: wrong value class: is not classorg.apache.nutch.fetcher.FetcherOutputatorg.apache.hadoop.io.SequenceFile$Reader.next(SequenceFile.java:346)atorg.apache.hadoop.mapred.SequenceFileRecordReader.next(SequenceFileRecordReader.java:58)

       at org.apache.hadoop.mapred.MapTask$3.next(MapTask.java:119)
       at org.apache.hadoop.mapred.MapRunner.run(MapRunner.java:46)
       at org.apache.hadoop.mapred.MapTask.run(MapTask.java:129)

atorg.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:91)2007-02-27 11:27:01,321 WARN util.ToolBase (ToolBase.java:doMain(185))- Job failed!

By looking at the Hadoop sources I noticed that the FetcherOutput classmentioned in this error message is determined by the SequenceFile classand obtained from the sequence file itself. Which, I think, indicatesthat the part-00000 file I use as input for the tool does indeed containFetcherOutput object(s).I get the same error when I remove the following line fromLocalFetcherRecover.java:

job.setOutputValueClass(FetcherOutput.class);

Any clues?
Btw, we use Nutch 0.8.1.

Thanks,
Mathijs





Andrzej Bialecki wrote:

Mathijs Homminga wrote:
Hi Andrzej,

The job stopped because there was no space left on the disk:
FATAL fetcher.Fetcher - org.apache.hadoop.fs.FSError:java.io.IOException: No space left on deviceFATAL fetcher.Fetcher - atorg.apache.hadoop.fs.LocalFileSystem$LocalFSFileOutputStream.write(LocalFileSystem.java:150)FATAL fetcher.Fetcher - atorg.apache.hadoop.fs.FSDataOutputStream$PositionCache.write(FSDataOutputStream.java:112)
We use a local FS. Temporary data is stored in /tmp/hadoop/mapred/
Ok, in your case this partial data may be recoverable, but with somemanual work involved ...
At this stage, I'm assuming that even if you started the reduce phaseits output won't be usable at all. So, we need to start from the datacontained in partial map outputs. Map outputs are a set ofSequenceFile's containing pairs of <Text, FetcherOutput> data. Umm,forgot to ask you - are you running trunk/ or Nutch 0.8 ? If trunk,then use the Text class, if 0.8 - replace all occurrences of Text withUTF8.
This is such a common problem that I created a special tool to addressthis - please see http://issues.apache.org/jira/browse/NUTCH-451 .
Let me repeat what the javadoc says, so that there's nomisunderstanding: if you use DFS and your fetch job is aborted, thereis no way in the world to recover the data - it's permanently lost. Ifyou run with a local FS, you can try this tool and hope for the best.

Re: Recovering aborted fetch

Reply via email to