[
https://issues.apache.org/jira/browse/MAHOUT-1410?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13881984#comment-13881984
]
Andrew Musselman commented on MAHOUT-1410:
------------------------------------------
Here's what I'm getting after running cluster-reuters.sh kmeans with and
without the --namedVector flag.
$ mahout seqdumper -i
/tmp/mahout-work-akm/reuters-kmeans/clusters-2-final/part-r-00000
MAHOUT_LOCAL is set, so we don't add HADOOP_CONF_DIR to classpath.
MAHOUT_LOCAL is set, running locally
SLF4J: Class path contains multiple SLF4J bindings.
SLF4J: Found binding in
[jar:file:/home/akm/mahout/examples/target/mahout-examples-1.0-SNAPSHOT-job.jar!/org/slf4j/impl/StaticLoggerBinder.class]
SLF4J: Found binding in
[jar:file:/home/akm/mahout/examples/target/dependency/slf4j-jcl-1.7.5.jar!/org/slf4j/impl/StaticLoggerBinder.class]
SLF4J: See http://www.slf4j.org/codes.html#multiple_bindings for an explanation.
SLF4J: Actual binding is of type [org.slf4j.impl.JCLLoggerFactory]
Jan 25, 2014 10:43:35 AM org.slf4j.impl.JCLLoggerAdapter info
INFO: Command line arguments: {--endPhase=[2147483647],
--input=[/tmp/mahout-work-akm/reuters-kmeans/clusters-2-final/part-r-00000],
--startPhase=[0], --tempDir=[temp]}
Input Path: /tmp/mahout-work-akm/reuters-kmeans/clusters-2-final/part-r-00000
Key class: class org.apache.hadoop.io.IntWritable Value Class: class
org.apache.mahout.clustering.iterator.ClusterWritable
Key: 0: Value: org.apache.mahout.clustering.iterator.ClusterWritable@5becf473
Key: 1: Value: org.apache.mahout.clustering.iterator.ClusterWritable@5becf473
Key: 2: Value: org.apache.mahout.clustering.iterator.ClusterWritable@5becf473
Key: 3: Value: org.apache.mahout.clustering.iterator.ClusterWritable@5becf473
Key: 4: Value: org.apache.mahout.clustering.iterator.ClusterWritable@5becf473
Key: 5: Value: org.apache.mahout.clustering.iterator.ClusterWritable@5becf473
Key: 6: Value: org.apache.mahout.clustering.iterator.ClusterWritable@5becf473
Key: 7: Value: org.apache.mahout.clustering.iterator.ClusterWritable@5becf473
Key: 8: Value: org.apache.mahout.clustering.iterator.ClusterWritable@5becf473
Key: 9: Value: org.apache.mahout.clustering.iterator.ClusterWritable@5becf473
Key: 10: Value: org.apache.mahout.clustering.iterator.ClusterWritable@5becf473
Key: 11: Value: org.apache.mahout.clustering.iterator.ClusterWritable@5becf473
Key: 12: Value: org.apache.mahout.clustering.iterator.ClusterWritable@5becf473
Key: 13: Value: org.apache.mahout.clustering.iterator.ClusterWritable@5becf473
Key: 14: Value: org.apache.mahout.clustering.iterator.ClusterWritable@5becf473
Key: 15: Value: org.apache.mahout.clustering.iterator.ClusterWritable@5becf473
Key: 16: Value: org.apache.mahout.clustering.iterator.ClusterWritable@5becf473
Key: 17: Value: org.apache.mahout.clustering.iterator.ClusterWritable@5becf473
Key: 18: Value: org.apache.mahout.clustering.iterator.ClusterWritable@5becf473
Key: 19: Value: org.apache.mahout.clustering.iterator.ClusterWritable@5becf473
Count: 20
Jan 25, 2014 10:43:35 AM org.slf4j.impl.JCLLoggerAdapter info
INFO: Program took 757 ms (Minutes: 0.012616666666666667)
> clusteredPoints do not contain a vector id
> ------------------------------------------
>
> Key: MAHOUT-1410
> URL: https://issues.apache.org/jira/browse/MAHOUT-1410
> Project: Mahout
> Issue Type: Bug
> Components: Clustering
> Affects Versions: 0.8
> Environment: using 0.9 release candidate
> Reporter: Pat Ferrel
> Assignee: Suneel Marthi
> Fix For: 0.9
>
> Attachments: MAHOUT-1410.patch, MAHOUT-1410.patch
>
>
> When clustering non-named vectors there are no vector ids in clusteredPoints
> so the other values there, cluster id, vector values, distance-squared, pdf,
> cannot be tied to any known vector.
--
This message was sent by Atlassian JIRA
(v6.1.5#6160)