I am trying to run the RowSimilarityJob on a small hadoop cluster, and
it fails with the following message.
java.io.IOException: wrong value class:
org.apache.mahout.math.hadoop.similarity.WeightedOccurrence is not class
org.apache.mahout.math.hadoop.similarity.WeightedOccurrenceArray
at
org.apache.hadoop.io.SequenceFile$Writer.append(SequenceFile.java:988)
at
org.apache.hadoop.mapreduce.lib.output.SequenceFileOutputFormat$1.write(
SequenceFileOutputFormat.java:74)
at
org.apache.hadoop.mapred.MapTask$NewDirectOutputCollector.write(MapTask.
java:498)
at
org.apache.hadoop.mapreduce.TaskInputOutputContext.write(TaskInputOutput
Context.java:80)
at
org.apache.mahout.math.hadoop.similarity.RowSimilarityJob$RowWeightMappe
r.map(RowSimilarityJob.java:222)
at
org.apache.mahout.math.hadoop.similarity.RowSimilarityJob$RowWeightMappe
r.map(RowSimilarityJob.java:199)
at org.apache.hadoop.mapreduce.Mapper.run(Mapper.java:144)
at org.apache.hadoop.mapred.MapTask.runNewMapper(MapTask.java:621)
at org.apache.hadoop.mapred.MapTask.run(MapTask.java:305)
at org.apache.hadoop.mapred.Child.main(Child.java:170)
There is no issue when I run the same class in a local mode using "
mahout rowsimilarity" command with the same input file. I looked at
the source code, I could not see any type mismatches.
i am using version 0.5 of mahout.
Please help me to resolve this issue.
Thank you
Seby Paul