This has to do with HADOOP-964. Replace the jar files in your Nutch
versions with the most recent versions from Hadoop. You will also need
to apply NUTCH-437 patch to get Nutch to work with the most recent
changes to the Hadoop codebase.
Dennis Kubes
Gal Nitzan wrote:
Hi,
Does anybody uses Nutch trunk?
I am running nutch 0.9 and unable to fetch.
after 50-60K urls I get NPE in
org.apache.hadoop.io.SequenceFile$Sorter$MergeQueue every time.
I was wandering if anyone have a work around or maybe something is wrong with
my setup.
I have opened a new issue in jira
http://issues.apache.org/jira/browse/hadoop-1008 for this.
Any clue?
Gal