I had issues with random forests. I was testing the following example: https://cwiki.apache.org/MAHOUT/partial-implementation.html
When I run the following command: $HADOOP_HOME/bin/hadoop jar $MAHOUT_HOME/core/target/mahout-core-<VERSION>-job.jar org.apache.mahout.df.tools.Describe -p testdata/KDDTrain+.arff -f testdata/KDDTrain+.info -d N 3 C 2 N C 4 N C 8 N 2 C 19 N L I get a java.lang.NoClassDefFoundError: org/apache/mahout/math/Vector The same command works fine with mahout-0.3. Inside mahout-core-0.3.job you can find the Vector class inside org/apache/mahout/mahout/, but I can't seem to find it inside mahout-core-0.4-SNAPSHOT-job.jar am I missing something ? On Fri, Oct 15, 2010 at 3:38 AM, Drew Farris <[email protected]> wrote: > Hi Deneche, Grant, > > There is an issue on jira related to this ( > https://issues.apache.org/jira/browse/MAHOUT-505) > > The long and short of it is that nexus has problems with the way we were > deploying artifacts that would prevent the jars for projects that produced > job jars being deployed correctly. The job jar would be available when > searching, but not the regular jar file. One way to work around this is to > move from *.job to -job.jar > > This also allows us to use the maven assembly mechanism to build the job > jars instead of using the ant build and maven build helper plug-in. There's > nothing wrong with the approach pre-505, the post-505 approach just achieves > the same goal with less configuration. > > As far as the Vector classes, the mahout-math jar is in the lib directory of > the new job jars and thus available on the classpath when jobs are run using > hadoop. > > Have you run into any issues using these new job jars? I've tested with the > build-reuters.sh script and run bayes training haven't experienced any > problems. > > Drew > > > On Oct 14, 2010 12:57 PM, "Grant Ingersoll" <[email protected]> wrote: >
