Yep work perfectly thanks for the help!

David Stuart

On 8 May 2010, at 10:24, Robin Anil <robin.a...@gmail.com> wrote:

oops. I figured it out. Please specify a -k (number of cluster)
parameter and the distance threshold. :) KMeans need to know either
the cluster count or the clusters in the -c clusters folder. If it
doesn't find k then it assumes you have initial clusters put in the
clusters folder.

PS: you can simply do a "bin/mahout kmeans" to run kmeans Since Mahout 0.3

Robin

On Sat, May 8, 2010 at 2:47 PM, david.stu...@progressivealliance.co.uk
<david.stu...@progressivealliance.co.uk> wrote:
Hi Robin,

I'm using the latest from trunk so 0.4

Seq file dump

bin/mahout org.apache.mahout.utils.SequenceFileDumper --seqFile / tmp/out.txt
Input Path: /tmp/out.txt
Key class: class org.apache.hadoop.io.LongWritable Value Class: class
org.apache.mahout.math.VectorWritable
Key: 0: Value: org.apache.mahout.math.vectorwrita...@6bffc686
Key: 1: Value: org.apache.mahout.math.vectorwrita...@6bffc686
Key: 2: Value: org.apache.mahout.math.vectorwrita...@6bffc686
Key: 3: Value: org.apache.mahout.math.vectorwrita...@6bffc686
Key: 4: Value: org.apache.mahout.math.vectorwrita...@6bffc686
Key: 5: Value: org.apache.mahout.math.vectorwrita...@6bffc686
Key: 6: Value: org.apache.mahout.math.vectorwrita...@6bffc686
Key: 7: Value: org.apache.mahout.math.vectorwrita...@6bffc686



On 08 May 2010 at 10:57 Robin Anil <robin.a...@gmail.com> wrote:

David, couple of things needed to debug this
1) Tell me which version of Mahout are you using.
2) use o.a.m.utils.SequenceFileDumper to dump the out.txt and see what
the key and value classes are

Robin

Reply via email to