Here you go.
java.lang.ClassCastException: java.util.ArrayList
at org.apache.nutch.parse.ParseData.write(ParseData.java:122)
at org.apache.nutch.parse.ParseImpl.write(ParseImpl.java:51)
at
org.apache.nutch.fetcher.FetcherOutput.write(FetcherOutput.java:57)
at
org.apache.nutch.io.SequenceFile$Writer.append(SequenceFile.java:168)
at org.apache.nutch.mapred.MapTask$1.collect(MapTask.java:78)
at
org.apache.nutch.fetcher.Fetcher$FetcherThread.output(Fetcher.java:229)
at
org.apache.nutch.fetcher.Fetcher$FetcherThread.run(Fetcher.java:123)
Andrzej Bialecki wrote:
Matt Zytaruk wrote:
The newest src (as of this morning) of trunk is occaisionally giving
ClassCastExceptions when doing a crawl, with parsing (and by
occaisionally I mean this was the only page out of the small list I
crawled that it happened on). This is with the nothing changed from
the defaults and on a server running Suse linux. Here is a sample of
the logging:
060106 111516 Parsing [http://easily.co.uk/] with
[EMAIL PROTECTED]
060106 111516 Using Signature impl: org.apache.nutch.crawl.MD5Signature
060106 111516 fetch of http://easily.co.uk/ failed with:
java.lang.ClassCastException: java.util.ArrayList
-Matt Zytaruk
Could you please add a call to printStackTrace() in that catch{}
statement, so that we know where the exception is thrown?