[ 
https://issues.apache.org/jira/browse/MAHOUT-680?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13037702#comment-13037702
 ] 

Elmer Garduno commented on MAHOUT-680:
--------------------------------------

I think this issue is causing us some problems when running mahout on a Hadoop 
cluster. 

In particular when running seq2sparse I get the following exception: 

Error: java.lang.ClassNotFoundException: 
org.apache.mahout.math.function.ObjectIntProcedure
        at java.net.URLClassLoader$1.run(URLClassLoader.java:202)
        at java.security.AccessController.doPrivileged(Native Method)
        at java.net.URLClassLoader.findClass(URLClassLoader.java:190)
        at java.lang.ClassLoader.loadClass(ClassLoader.java:306)
        at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:301)
        at java.lang.ClassLoader.loadClass(ClassLoader.java:247)
        at java.lang.Class.forName0(Native Method)
        at java.lang.Class.forName(Class.java:247)
        at 
org.apache.hadoop.conf.Configuration.getClassByName(Configuration.java:943)
        at org.apache.hadoop.conf.Configuration.getClass(Configuration.java:994)
        at 
org.apache.hadoop.mapreduce.JobContext.getMapperClass(JobContext.java:212)
        at org.apache.hadoop.mapred.MapTask.runNewMapper(MapTask.java:601)
        at org.apache.hadoop.mapred.MapTask.run(MapTask.java:322)
        at org.apache.hadoop.mapred.Child$4.run(Child.java:268)
        at java.security.AccessController.doPrivileged(Native Method)
        at javax.security.auth.Subject.doAs(Subject.java:396)
        at 
org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1115)
        at org.apache.hadoop.mapred.Child.main(Child.java:262)

Commenting this line fixes the problem
 export HADOOP_CLASSPATH=$MAHOUT_CONF_DIR:${HADOOP_CLASSPATH}:$CLASSPATH

The Mahout version we are using was compiled from 
http://svn.apache.org/repos/asf/mahout/tags/mahout-0.5/





> Running the Hadoop script through bin/mahout to set up classpath
> ----------------------------------------------------------------
>
>                 Key: MAHOUT-680
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-680
>             Project: Mahout
>          Issue Type: Improvement
>    Affects Versions: 0.4
>            Reporter: Frank Scholten
>            Priority: Minor
>             Fix For: 0.5
>
>         Attachments: MAHOUT-680.patch, MAHOUT-680.patch
>
>
> Added a patch which allows you to run the $HADOOP_HOME/bin/hadoop command 
> script through the bin/mahout script.
> This way the Mahout script adds the Mahout classes to the $HADOOP_CLASSPATH 
> so you can view sequencefiles generated by Mahout jobs with
> bin/mahout hadoop fs -text <sequencefile>
> without having to specify Mahout classes manually or getting 
> ClassNotFoundExceptions

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to