Are u using Mahout 0.7 ?

From this line in ur stacktrace that seems to be the case:
MAHOUT-JOB: /usr/lib/mahout/mahout-examples-0.7-cdh4.5.0-job.jar

You could build Mahout outside of CDH from Mahout trunk and put the jars onto 
CDH5.
I am no Cloudera expert or CDH5 user to help with CDHx build.







On Wednesday, March 5, 2014 9:30 AM, Kevin Moulart <kevinmoul...@gmail.com> 
wrote:
 
Hi and thanks for your help!

I had been told that the version of mahout used by Cloudera (CDH 4.6) was in 
fact 0.8 with a patch for mr2 support.
( 
http://mail-archives.apache.org/mod_mbox/mahout-user/201402.mbox/%3CCAEccTywqSAKA_HeX4vTZ-5XPmKtj5b8zMGQUfn5qRsiq=7o=u...@mail.gmail.com%3E
 )

But I tried to install 0.9 on my own, by compiling it with mvn after I changed 
the pom.xml :

- Added cloudera repository :


    <repository>
      <id>cloudera-repo</id>
      <name>Cloudera Repository</name>
      <url>https://repository.cloudera.com/artifactory/cloudera-repos</url>
    </repository>

- Changed the version of hadoop to use :
    <hadoop.1.version>2.0.0-mr1-cdh4.6.0</hadoop.1.version>

- I tried adding this one too :
    <hadoop2.version>2.0.0-cdh4.6.0</hadoop2.version>

But then I get a lot of errors when Maven begins to compile the core package :
https://gist.github.com/kmoulart/9368193


Could you tell me what I did wrong ?



2014-03-04 19:02 GMT+01:00 Suneel Marthi <suneel_mar...@yahoo.com>:

The -us option was fixed for Mahout 0.8, seems like u r using Mahout 0.7 which 
had this issue (from ur stacktrace, its apparent u r using Mahout 0.7).  Please 
upgrade to the latest mahout version.
>
>
>
>
>
>
>On Tuesday, March 4, 2014 8:54 AM, Kevin Moulart <kevinmoul...@gmail.com> 
>wrote:
>
>Hi,
>
>I'm trying to apply a PCA to reduce the dimension of a matrix of 1603
>columns and 100.000 to 30.000.000 lines using ssvd with the pca option, and
>I always get a StackOverflowError :
>
>Here is my command line :
>mahout ssvd -i /user/myUser/Echant100k -o /user/myUser/Echant/SVD100 -k 100
>-pca "true" -U "false" -V "false" -t 3 -ow
>
>I also tried to put "-us true" as mentionned in
>https://cwiki.apache.org/confluence/download/attachments/27832158/SSVD-CLI.pdf?version=18&modificationDate=1381347063000&api=v2but
>the option is not available anymore.
>
>The output of the previous command is :
>MAHOUT_LOCAL is not set; adding HADOOP_CONF_DIR to classpath.
>Running on hadoop, using /opt/cloudera/parcels/CDH/lib/hadoop/bin/hadoop
>and HADOOP_CONF_DIR=/etc/hadoop/conf
>MAHOUT-JOB: /usr/lib/mahout/mahout-examples-0.7-cdh4.5.0-job.jar
>14/03/04 14:45:16 INFO common.AbstractJob: Command line arguments:
>{--abtBlockHeight=[200000], --blockHeight=[10000], --broadcast=[true],
>--computeU=[false], --computeV=[false], --endPhase=[2147483647],
>--input=[/user/myUser/Echant100k], --minSplitSize=[-1],
>--outerProdBlockHeight=[30000], --output=[/user/myUser/Echant/SVD100],
>--oversampling=[15], --overwrite=null, --pca=[true], --powerIter=[0],
>--rank=[100], --reduceTasks=[3], --startPhase=[0], --tempDir=[temp],
>--uHalfSigma=[false], --vHalfSigma=[false]}
>Exception in thread "main" java.lang.StackOverflowError
>at
>org.apache.mahout.math.hadoop.MatrixColumnMeansJob.run(MatrixColumnMeansJob.java:55)
>at
>org.apache.mahout.math.hadoop.MatrixColumnMeansJob.run(MatrixColumnMeansJob.java:55)
>at
>org.apache.mahout.math.hadoop.MatrixColumnMeansJob.run(MatrixColumnMeansJob.java:55)
>...
>
>I search online and didn't find a solution to my problem.
>
>Can you help me ?
>
>Thanks in advance,
>
>--
>Kévin Moulart


-- 

Kévin Moulart
GSM France : +33 7 81 06 10 10
GSM Belgique : +32 473 85 23 85
Téléphone fixe : +32 2 771 88 45

Reply via email to