I've got latest versions of nutch (0.9-dev) and hadoop (Trunk) from svn.
When I try to index I get the next error:

java.lang.ClassCastException: org.apache.nutch.parse.ParseData
     at org.apache.nutch.indexer.Indexer$InputFormat$1.next(Indexer.java:92)
     at org.apache.hadoop.mapred.MapTask$3.next(MapTask.java:184)
     at org.apache.hadoop.mapred.MapRunner.run(MapRunner.java:44)
     at org.apache.hadoop.mapred.MapTask.run(MapTask.java:196)
     at
org.apache.hadoop.mapred.TaskTracker$Child.main(TaskTracker.java:1075)

 
This exception is raised from method next(Writable key, Writable value) of
class SequenceFileRecordReader. 

Method 'next' is called with 'value' parameter that have different class for
each its call (classes are crawlDatum, ParseData or Inlinks). 

And when these classes (crawlDatum, ParseData or Inlinks) are cast I get
classCastException.

Why do I get this exception? I looked at old sources but didn't find
distinctions in algorithm. What do I miss?



-------------------------------------------------------------------------
Using Tomcat but need to do more? Need to support web services, security?
Get stuff done quickly with pre-integrated technology to make your job easier
Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo
http://sel.as-us.falkag.net/sel?cmd=lnk&kid=120709&bid=263057&dat=121642
_______________________________________________
Nutch-developers mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-developers

Reply via email to