Here you go.

java.lang.ClassCastException: java.util.ArrayList
       at org.apache.nutch.parse.ParseData.write(ParseData.java:122)
       at org.apache.nutch.parse.ParseImpl.write(ParseImpl.java:51)
at org.apache.nutch.fetcher.FetcherOutput.write(FetcherOutput.java:57) at org.apache.nutch.io.SequenceFile$Writer.append(SequenceFile.java:168)
       at org.apache.nutch.mapred.MapTask$1.collect(MapTask.java:78)
at org.apache.nutch.fetcher.Fetcher$FetcherThread.output(Fetcher.java:229) at org.apache.nutch.fetcher.Fetcher$FetcherThread.run(Fetcher.java:123)

Andrzej Bialecki wrote:

Matt Zytaruk wrote:

The newest src (as of this morning) of trunk is occaisionally giving ClassCastExceptions when doing a crawl, with parsing (and by occaisionally I mean this was the only page out of the small list I crawled that it happened on). This is with the nothing changed from the defaults and on a server running Suse linux. Here is a sample of the logging:

060106 111516 Parsing [http://easily.co.uk/] with [EMAIL PROTECTED]
060106 111516 Using Signature impl: org.apache.nutch.crawl.MD5Signature
060106 111516 fetch of http://easily.co.uk/ failed with: java.lang.ClassCastException: java.util.ArrayList

-Matt Zytaruk



Could you please add a call to printStackTrace() in that catch{} statement, so that we know where the exception is thrown?


Reply via email to