Hi, Grant
I just install 0.3, but there is still some problem on dumping the cluser
from output files.
What I did detailly is following:
hadoop jar mahout-examples-0.3-SNAPSHOT.job
org.apache.mahout.clustering.syntheticcontrol.kmeans.Job -i testdata
the output is
[...@master target]$ hadoop fs -ls output
Found 13 items
drwxr-xr-x - hadoop supergroup 0 2010-02-22 17:35
/user/hadoop/output/canopies
drwxr-xr-x - hadoop supergroup 0 2010-02-22 17:35
/user/hadoop/output/clusters-0
drwxr-xr-x - hadoop supergroup 0 2010-02-22 17:35
/user/hadoop/output/clusters-1
drwxr-xr-x - hadoop supergroup 0 2010-02-22 17:36
/user/hadoop/output/clusters-2
drwxr-xr-x - hadoop supergroup 0 2010-02-22 17:36
/user/hadoop/output/clusters-3
drwxr-xr-x - hadoop supergroup 0 2010-02-22 17:37
/user/hadoop/output/clusters-4
drwxr-xr-x - hadoop supergroup 0 2010-02-22 17:37
/user/hadoop/output/clusters-5
drwxr-xr-x - hadoop supergroup 0 2010-02-22 17:37
/user/hadoop/output/clusters-6
drwxr-xr-x - hadoop supergroup 0 2010-02-22 17:38
/user/hadoop/output/clusters-7
drwxr-xr-x - hadoop supergroup 0 2010-02-22 17:38
/user/hadoop/output/clusters-8
drwxr-xr-x - hadoop supergroup 0 2010-02-22 17:39
/user/hadoop/output/clusters-9
drwxr-xr-x - hadoop supergroup 0 2010-02-22 17:34
/user/hadoop/output/data
drwxr-xr-x - hadoop supergroup 0 2010-02-22 17:39
/user/hadoop/output/points
then I run : sh /data/hadoop/mahout/trunk/bin/mahout clusterdump -s
clusters-9/ -p points/
Input Path: /data/hadoop/mahout/trunk/examples/target/clusters-9/part-00000
Exception in thread "main" java.lang.NullPointerException
at
org.apache.mahout.utils.vectors.VectorHelper.vectorToString(VectorHelper.java:69)
at
org.apache.mahout.utils.clustering.ClusterDumper.printClusters(ClusterDumper.java:133)
at
org.apache.mahout.utils.clustering.ClusterDumper.main(ClusterDumper.java:302)
and I run : sh /data/hadoop/mahout/trunk/bin/mahout clusterdump -s
testdata/synthetic_control.data -p points/
Error message is :
Exception in thread "main" java.lang.NullPointerException
at
org.apache.mahout.utils.clustering.ClusterDumper.printClusters(ClusterDumper.java:121)
at
org.apache.mahout.utils.clustering.ClusterDumper.main(ClusterDumper.java:302)
Is there something wrong?
Thanks!!
2010/2/19 Grant Ingersoll <[email protected]>
>
> On Feb 19, 2010, at 8:33 AM, Cui tony wrote:
>
> > Thank you all you guys.
> > I know how to change seqFile to txt file now.
> >
> > I'm sorry, Grant, your example is still a little complicated to me. How
> can
> > I run in this command: bin/mahout ?
> > This is the command line which I used :
> > hadoop jar data/mahout-examples-0.2.job
> > org.apache.mahout.clustering.syntheticcontrol.kmeans.Job
>
> Ah, sorry. Didn't realize you were using 0.2. bin/mahout is in trunk.
> We're about to release 0.3, so don't be afraid of trunk.
>
> >
> > And, Grant, could you give me some information on how to use SequenceFile
> > Dumper or class dumper?
> >
>
> They have a main class in them. If you setup the classpath, etc. you can
> use --help to see the options.
>
> -Grant