Hi,

I'm trying to use Mahout's SVD with no success so far.
I converted my input  from CSV format using the attached class.
Then I run the following command
hadoop jar mahout-examples-0.3.job
org.apache.mahout.math.hadoop.decomposer.DistributedLanczosSolver
-i /hdfs/data/svd/user_doc_score -o  /hdfs/data/svd/svd-output -r 10 -nr
6040 -nc 3282 -sym 0

and get this error:
org.apache.mahout.math.CardinalityException: My cardinality is: 2147483647,
but the other is: 3282
        at org.apache.mahout.math.RandomAccessSparseVector.dot
(RandomAccessSparseVector.java:275)
        at org.apache.mahout.math.hadoop.TimesSquaredJob
$TimesSquaredMapper.scale(TimesSquaredJob.java:200)
        at org.apache.mahout.math.hadoop.TimesSquaredJob
$TimesSquaredMapper.map(TimesSquaredJob.java:191)
        at org.apache.mahout.math.hadoop.TimesSquaredJob
$TimesSquaredMapper.map(TimesSquaredJob.java:147)
        at org.apache.hadoop.mapred.MapRunner.run(MapRunner.java:50)
        at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:358)
        at org.apache.hadoop.mapred.MapTask.run(MapTask.java:307)
        at org.apache.hadoop.mapred.Child.main(Child.java:170)

Any ideas/suggestions?

Thanks,
Avishay

(See attached file: CSVtoSeq.java)

Reply via email to