Hi,

Using nutch 0.9, although I get the same with a more recent nightly build.

I'm getting NPE fetching these two pages:

http://www.absoluteit.co.nz
and
http://defence.allmedia.co.nz

I've tracked it down by putting a t.printStackTrace() in the catch (Throwable t) of the run() in Fetcher.java:
java.lang.NullPointerException
        at org.apache.hadoop.io.Text.encode(Text.java:375)
        at org.apache.hadoop.io.Text.encode(Text.java:356)
        at org.apache.hadoop.io.Text.writeString(Text.java:396)
at org.apache.nutch.protocol.Content.writeCompressed(Content.java:146) at org.apache.hadoop.io.CompressedWritable.write(CompressedWritable.java:74) at org.apache.nutch.fetcher.FetcherOutput.write(FetcherOutput.java:56) at org.apache.hadoop.mapred.MapTask$MapOutputBuffer.collect(MapTask.java:315) at org.apache.nutch.fetcher.Fetcher$FetcherThread.output(Fetcher.java:343) at org.apache.nutch.fetcher.Fetcher$FetcherThread.run(Fetcher.java:191)

I'm not sure where to go from here. Any suggestions?

Cheers,
Carl.

Reply via email to