Hi, I've done a mnv clean install on the latest trunk and I'm getting the following error when trying to run seqdirectory example in MIA.
~/dev/mahout$ bin/mahout seqdirectory -core -c UTF-8 -i examples/reuters-extracted/ -o reuters-seqfiles Running on hadoop, using HADOOP_HOME=/home/riddelln/dev/hadoop-0.20.203.0 HADOOP_CONF_DIR=/home/riddelln/dev/hadoop-0.20.203.0/conf MAHOUT-JOB: /home/riddelln/dev/mahout/examples/target/mahout-examples-0.6-SNAPSHOT-job.jar 11/08/12 08:07:06 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.utils.clustering.ClusterDumper 11/08/12 08:07:06 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.classifier.bayes.PrepareTwentyNewsgroups 11/08/12 08:07:06 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.classifier.bayes.WikipediaDatasetCreatorDriver 11/08/12 08:07:06 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.classifier.sgd.TrainLogistic 11/08/12 08:07:06 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.utils.vectors.lucene.Driver 11/08/12 08:07:06 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.classifier.sgd.RunAdaptiveLogistic 11/08/12 08:07:06 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.utils.SequenceFileDumper 11/08/12 08:07:06 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.classifier.bayes.WikipediaXmlSplitter 11/08/12 08:07:06 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.classifier.sgd.PrintResourceOrFile 11/08/12 08:07:06 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.classifier.sgd.ValidateAdaptiveLogistic 11/08/12 08:07:06 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.text.WikipediaToSequenceFile 11/08/12 08:07:06 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.classifier.sgd.TrainAdaptiveLogistic 11/08/12 08:07:06 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.utils.vectors.VectorDumper 11/08/12 08:07:06 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.utils.vectors.RowIdJob 11/08/12 08:07:06 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.clustering.lda.LDAPrintTopics 11/08/12 08:07:06 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.text.SequenceFilesFromDirectory 11/08/12 08:07:06 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.classifier.sgd.RunLogistic 11/08/12 08:07:06 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.utils.vectors.arff.Driver 11/08/12 08:07:06 WARN driver.MahoutDriver: No seqdirectory.props found on classpath, will use command-line arguments only Unknown program 'seqdirectory' chosen. Valid program names are: baumwelch: : Baum-Welch algorithm for unsupervised HMM training canopy: : Canopy clustering cleansvd: : Cleanup and verification of SVD output dirichlet: : Dirichlet Clustering eigencuts: : Eigencuts spectral clustering evaluateFactorization: : compute RMSE of a rating matrix factorization against probes in memory evaluateFactorizationParallel: : compute RMSE of a rating matrix factorization against probes fkmeans: : Fuzzy K-means clustering fpg: : Frequent Pattern Growth hmmpredict: : Generate random sequence of observations by given HMM itemsimilarity: : Compute the item-item-similarities for item-based collaborative filtering kmeans: : K-means clustering lda: : Latent Dirchlet Allocation matrixmult: : Take the product of two matrices meanshift: : Mean Shift clustering parallelALS: : ALS-WR factorization of a rating matrix predictFromFactorization: : predict preferences from a factorization of a rating matrix recommenditembased: : Compute recommendations using item-based collaborative filtering rowsimilarity: : Compute the pairwise similarities of the rows of a matrix seq2sparse: : Sparse Vector generation from Text sequence files spectralkmeans: : Spectral k-means clustering splitDataset: : split a rating dataset into training and probe parts ssvd: : Stochastic SVD svd: : Lanczos Singular Value Decomposition testclassifier: : Test Bayes Classifier trainclassifier: : Train Bayes Classifier transpose: : Take the transpose of a matrix vecdist: : Compute the distances between a set of Vectors (or Cluster or Canopy, they must fit in memory) and a list of Vectors viterbi: : Viterbi decoding of hidden states from given output states sequence Any thoughts, please? Cheers Niall Riddell xSpace Analytics Ltd T: +44 161 408 3830 M:+44 778 696 3830 Skype: niall.riddell
