I took a look at the code, and the only thing that seemed to be required was the Sequence file. I pulled the part-00000 file from output/clusterPoints/ folder off of my Hadoop cluster, and tried the following command:
bin/mahout clusterdump --seqFileDir part-00000 --output testFile.txt This resulted in the following error: no HADOOP_CONF_DIR or HADOOP_HOME set, running locally Apr 9, 2010 3:00:25 PM org.slf4j.impl.JCLLoggerAdapter error SEVERE: MahoutDriver failed with args: [--seqFileDir, part-00000, --output, testFile.txt, null] null Exception in thread "main" java.lang.NullPointerException at org.apache.mahout.utils.clustering.ClusterDumper.printClusters(ClusterDumper.java:121) at org.apache.mahout.utils.clustering.ClusterDumper.main(ClusterDumper.java:302) at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39) at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25) at java.lang.reflect.Method.invoke(Method.java:597) at org.apache.hadoop.util.ProgramDriver$ProgramDescription.invoke(ProgramDriver.java:68) at org.apache.hadoop.util.ProgramDriver.driver(ProgramDriver.java:139) at org.apache.mahout.driver.MahoutDriver.main(MahoutDriver.java:172) Strange, since $HADOOP_HOME is actually set. Thoughts? Jeff Eastman wrote: > > The dictionary file contains a list (not sure how its delimited) of > element names for the input Vectors and is optional. See the new code in > trunk/utils in TestClusterDumper for some examples. I need to write test > sfor meanshift and also fuzzy kmeans to make sure they work but I > imagine they do. I also need to write tests that include the points, but > that appears to be done in memory so it likely won't scale to your > 5-node data set. > > Jeff > > adam35413 wrote: >> I have been able to successfully run the kmean and meanshift examples on >> a >> 5-node Hadoop cluster. However, when it comes to dealing with the >> output, I >> am a bit confused. I found the following page: >> http://cwiki.apache.org/MAHOUT/viewing-results.html, but when I went to >> track down the dictionary file I was unable to find it. Do I need to >> generate the dictionary file separately or manually? >> >> Thanks! >> > > > -- View this message in context: http://n3.nabble.com/Dealing-with-kmean-and-meanshift-output-tp708824p709022.html Sent from the Mahout User List mailing list archive at Nabble.com.