Hi all I guest I must've seen somewhere on very similar topics on classname change in Mahout-0.8-SNAPSHOT for some of the Lucene analyzer and here is another one that I need to be solved. Mahout gave me an error for seq2sparse with Lucene analyzer option as follows, which of cource had been working in at least Mahout 0.7.
$MAHOUT_HOME/bin/mahout seq2sparse --namedVector -i NHTSA-seqfile01/ -o NHTSA-namedVector -ow -a org.apache.lucene.analysis.WhitespaceAnalyzer -chunk 200 -wt tfidf -s 5 -md 3 -x 90 -ng 2 -ml 50 -seq -n 2 Running on hadoop, using /usr/local/hadoop/bin/hadoop and HADOOP_CONF_DIR= MAHOUT-JOB: /usr/local/trunk/examples/target/mahout-examples-0.8-SNAPSHOT-job.jar 13/05/07 15:41:12 INFO vectorizer.SparseVectorsFromSequenceFiles: Maximum n-gram size is: 2 13/05/07 15:41:18 INFO vectorizer.SparseVectorsFromSequenceFiles: Minimum LLR value: 50.0 13/05/07 15:41:18 INFO vectorizer.SparseVectorsFromSequenceFiles: Number of reduce tasks: 1 Exception in thread "main" java.lang.ClassNotFoundException: org.apache.lucene.analysis.WhitespaceAnalyzer I have confirmed what classpath Mahout is refering to as; $ $MAHOUT_HOME/bin/mahout classpath and obtained Lucene related classpath as below. /usr/local/trunk/examples/target/dependency/lucene-analyzers-common-4.2.1.jar /usr/local/trunk/examples/target/dependency/lucene-benchmark-4.2.1.jar: /usr/local/trunk/examples/target/dependency/lucene-core-4.2.1.jar /usr/local/trunk/examples/target/dependency/lucene-facet-4.2.1.jar /usr/local/trunk/examples/target/dependency/lucene-highlighter-4.2.1.jar /usr/local/trunk/examples/target/dependency/lucene-memory-4.2.1.jar /usr/local/trunk/examples/target/dependency/lucene-queries-4.2.1.jar /usr/local/trunk/examples/target/dependency/lucene-queryparser-4.2.1.jar /usr/local/trunk/examples/target/dependency/lucene-sandbox-4.2.1.jar I want to believe this to be simple classname change related issue. Please let me be advised. Regards,,, Y.Mandai
