All, 
  I am trying to test Stochastic SVD and am facing some errors where it would 
be great if  someone could clarifying what is going on. I am trying to feed the 
solver a DistributedRowMatrix with the exact same parameters that the test in  
LocalSSVDSolverSparseSequentialTest uses, i.e, Generate a 1000 X 100 DRM with 
SequentialSparseVectors and then ask for blockHeight 251, p (oversampling) = 
60, k (rank) = 40. I get the following error:

Exception in thread "main" java.io.IOException: Unexpected overrun in upper 
triangular matrix files
        at 
org.apache.mahout.math.hadoop.stochasticsvd.SSVDSolver.loadUpperTriangularMatrix(SSVDSolver.java:471)
        at 
org.apache.mahout.math.hadoop.stochasticsvd.SSVDSolver.run(SSVDSolver.java:268)
        at com.mozilla.SSVDCli.run(SSVDCli.java:89)
        at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65)
        at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:79)
        at com.mozilla.SSVDCli.main(SSVDCli.java:129)
        at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
        at 
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
        at 
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
        at java.lang.reflect.Method.invoke(Method.java:597)
        at org.apache.hadoop.util.RunJar.main(RunJar.java:186)

Also, I am using CDH3 with Mahout recompiled to work with CDH3 jars.

Thanks
Esh

Reply via email to