Anyone have guidelines on needed heap size when running SVD?  I've done a 
couple of fairly long runs on my single machine and keep running out of mem. 
fairly deep into the run.  Before I increase the heap size for the 4th time, I 
figured I'd see if it is even going to fit into memory at all.

My matrix is ~ 130,000 x 62,000 and I have 4GB total on my machine.  I'm 
running this locally for now as a first step in scaling it out.

Here's my command:  ./mahout svd 
-Dmapred.input.dir=/tmp/solr-clust-n2/part-out.vec --numCols 61892 --tempDir 
/tmp/solr-clust-n2-svd --rank 1000 --numRows 129444

Thanks,
Grant

Reply via email to