Hi,
I am able to dump vector for tfidf-vectors, tf-vectors files
Which gets generated during SparseVectorsFromSequenceFiles.
I am getting following error when I try to generate vectors from
Wordcount (files generated during SparseVectorsFromSequenceFiles )
Error -
Exception in thread "main" java.lang.ClassCastException:
org.apache.hadoop.io.LongWritable cannot be cast to
org.apache.mahout.math.VectorWritable
at
org.apache.mahout.utils.vectors.VectorDumper.main(VectorDumper.java:144)
tokenized-documents (files generated during SparseVectorsFromSequenceFiles )
Error -
Exception in thread "main" java.lang.ClassCastException:
org.apache.mahout.common.StringTuple cannot be cast to
org.apache.mahout.math.VectorWritable
at
org.apache.mahout.utils.vectors.VectorDumper.main(VectorDumper.java:144)
df-count (files generated during SparseVectorsFromSequenceFiles )
Error -
Exception in thread "main" java.lang.ClassCastException:
org.apache.hadoop.io.LongWritable cannot be cast to
org.apache.mahout.math.VectorWritable
at
org.apache.mahout.utils.vectors.VectorDumper.main(VectorDumper.java:144)
dictionary.file-0 (files generated during SparseVectorsFromSequenceFiles )
Error -
Exception in thread "main" java.lang.ClassCastException:
org.apache.hadoop.io.IntWritable cannot be cast to
org.apache.mahout.math.VectorWritable
at
org.apache.mahout.utils.vectors.VectorDumper.main(VectorDumper.java:144)
What am I doing wrong here ?
Regards,
Divya