Hi,

 

I am able to dump vector for tfidf-vectors, tf-vectors files 

Which gets generated during  SparseVectorsFromSequenceFiles.

 

I am getting following error when I try to generate vectors from 

 

 

Wordcount (files generated during SparseVectorsFromSequenceFiles )

 

Error -

Exception in thread "main" java.lang.ClassCastException:
org.apache.hadoop.io.LongWritable cannot be cast to
org.apache.mahout.math.VectorWritable

                at
org.apache.mahout.utils.vectors.VectorDumper.main(VectorDumper.java:144)

 

 

 

tokenized-documents (files generated during SparseVectorsFromSequenceFiles )

 

Error -

 

Exception in thread "main" java.lang.ClassCastException:
org.apache.mahout.common.StringTuple cannot be cast to
org.apache.mahout.math.VectorWritable

                at
org.apache.mahout.utils.vectors.VectorDumper.main(VectorDumper.java:144)

 

 

 

 

df-count (files generated during SparseVectorsFromSequenceFiles )

 

Error -

 

Exception in thread "main" java.lang.ClassCastException:
org.apache.hadoop.io.LongWritable cannot be cast to
org.apache.mahout.math.VectorWritable

                at
org.apache.mahout.utils.vectors.VectorDumper.main(VectorDumper.java:144)

 

 

 

dictionary.file-0 (files generated during SparseVectorsFromSequenceFiles )

 

Error -

 

Exception in thread "main" java.lang.ClassCastException:
org.apache.hadoop.io.IntWritable cannot be cast to
org.apache.mahout.math.VectorWritable

                at
org.apache.mahout.utils.vectors.VectorDumper.main(VectorDumper.java:144)

 

 

 

What am I doing wrong here ?

 

 

Regards,

Divya 

 

 

Reply via email to