[ 
https://issues.apache.org/jira/browse/MAHOUT-680?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13037774#comment-13037774
 ] 

Frank Scholten commented on MAHOUT-680:
---------------------------------------

Elmer, I couldn't reproduce your exception.

I did the following:

* I checked out the 0.5 tag and did a mvn clean install -DskipTests=true
* I started a Whirr cluster with 1 namenode/jobtracker and 2 
datanodes/tasktrackers
* I cleared all my local MAHOUT_* and HADOOP_* environment variables and I 
pointed HADOOP_HOME at a local 0.20.2 Hadoop install and pointed 
HADOOP_CONF_DIR at the conf dir with hadoop-site.xml, generated by Whirr.

I then ran the following:

$ hadoop fs -mkdir input
$ hadoop fs -put README.txt input
$ bin/mahout seqdirectory --input input --output output --charset UTF-8
$ bin/mahout hadoop fs -cat output/chunk-0
$ bin/mahout seq2sparse --input output --output output-seq2sparse

This work without problems. I attached the jobtracker HTML page.

How did you run your jobs?

I'm currently scp'ing the entire mahout folder to the namenode, ssh to it, and 
run the job from there to see if that might be causing problems.

> Running the Hadoop script through bin/mahout to set up classpath
> ----------------------------------------------------------------
>
>                 Key: MAHOUT-680
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-680
>             Project: Mahout
>          Issue Type: Improvement
>    Affects Versions: 0.4
>            Reporter: Frank Scholten
>            Priority: Minor
>             Fix For: 0.5
>
>         Attachments: MAHOUT-680.patch, MAHOUT-680.patch
>
>
> Added a patch which allows you to run the $HADOOP_HOME/bin/hadoop command 
> script through the bin/mahout script.
> This way the Mahout script adds the Mahout classes to the $HADOOP_CLASSPATH 
> so you can view sequencefiles generated by Mahout jobs with
> bin/mahout hadoop fs -text <sequencefile>
> without having to specify Mahout classes manually or getting 
> ClassNotFoundExceptions

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to