Hi, I just started using Mahout a few or two ago and so far its been pretty good. I working on some term collocation and while I have been working from a directory of files, I want to switch to using lucene indexes as that is the format the files are already in. I am trying to use the lucene.vector to turn the indexes into vector and then use org.apache.mahout.vectorizer.collocations.llr.CollocDriver to generate the collocations and LLRs. I keep getting this error when I run CollocDriver, any ideas?
java.lang.ClassCastException: org.apache.hadoop.io.LongWritable cannot be cast to org.apache.hadoop.io.Text at org.apache.mahout.vectorizer.collocations.llr.CollocMapper.map(CollocMapper.java:40) at org.apache.hadoop.mapreduce.Mapper.run(Mapper.java:144) at org.apache.hadoop.mapred.MapTask.runNewMapper(MapTask.java:763) at org.apache.hadoop.mapred.MapTask.run(MapTask.java:369) at org.apache.hadoop.mapred.Child$4.run(Child.java:259) at java.security.AccessController.doPrivileged(Native Method) at javax.security.auth.Subject.doAs(Subject.java:396) at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1059) at org.apache.hadoop.mapred.Child.main(Child.java:253) -- Peter Andrews
