[ 
https://issues.apache.org/jira/browse/MAHOUT-1410?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13881984#comment-13881984
 ] 

Andrew Musselman commented on MAHOUT-1410:
------------------------------------------

Here's what I'm getting after running cluster-reuters.sh kmeans with and 
without the --namedVector flag.

$ mahout seqdumper -i 
/tmp/mahout-work-akm/reuters-kmeans/clusters-2-final/part-r-00000
MAHOUT_LOCAL is set, so we don't add HADOOP_CONF_DIR to classpath.
MAHOUT_LOCAL is set, running locally
SLF4J: Class path contains multiple SLF4J bindings.
SLF4J: Found binding in 
[jar:file:/home/akm/mahout/examples/target/mahout-examples-1.0-SNAPSHOT-job.jar!/org/slf4j/impl/StaticLoggerBinder.class]
SLF4J: Found binding in 
[jar:file:/home/akm/mahout/examples/target/dependency/slf4j-jcl-1.7.5.jar!/org/slf4j/impl/StaticLoggerBinder.class]
SLF4J: See http://www.slf4j.org/codes.html#multiple_bindings for an explanation.
SLF4J: Actual binding is of type [org.slf4j.impl.JCLLoggerFactory]
Jan 25, 2014 10:43:35 AM org.slf4j.impl.JCLLoggerAdapter info
INFO: Command line arguments: {--endPhase=[2147483647], 
--input=[/tmp/mahout-work-akm/reuters-kmeans/clusters-2-final/part-r-00000], 
--startPhase=[0], --tempDir=[temp]}
Input Path: /tmp/mahout-work-akm/reuters-kmeans/clusters-2-final/part-r-00000
Key class: class org.apache.hadoop.io.IntWritable Value Class: class 
org.apache.mahout.clustering.iterator.ClusterWritable
Key: 0: Value: org.apache.mahout.clustering.iterator.ClusterWritable@5becf473
Key: 1: Value: org.apache.mahout.clustering.iterator.ClusterWritable@5becf473
Key: 2: Value: org.apache.mahout.clustering.iterator.ClusterWritable@5becf473
Key: 3: Value: org.apache.mahout.clustering.iterator.ClusterWritable@5becf473
Key: 4: Value: org.apache.mahout.clustering.iterator.ClusterWritable@5becf473
Key: 5: Value: org.apache.mahout.clustering.iterator.ClusterWritable@5becf473
Key: 6: Value: org.apache.mahout.clustering.iterator.ClusterWritable@5becf473
Key: 7: Value: org.apache.mahout.clustering.iterator.ClusterWritable@5becf473
Key: 8: Value: org.apache.mahout.clustering.iterator.ClusterWritable@5becf473
Key: 9: Value: org.apache.mahout.clustering.iterator.ClusterWritable@5becf473
Key: 10: Value: org.apache.mahout.clustering.iterator.ClusterWritable@5becf473
Key: 11: Value: org.apache.mahout.clustering.iterator.ClusterWritable@5becf473
Key: 12: Value: org.apache.mahout.clustering.iterator.ClusterWritable@5becf473
Key: 13: Value: org.apache.mahout.clustering.iterator.ClusterWritable@5becf473
Key: 14: Value: org.apache.mahout.clustering.iterator.ClusterWritable@5becf473
Key: 15: Value: org.apache.mahout.clustering.iterator.ClusterWritable@5becf473
Key: 16: Value: org.apache.mahout.clustering.iterator.ClusterWritable@5becf473
Key: 17: Value: org.apache.mahout.clustering.iterator.ClusterWritable@5becf473
Key: 18: Value: org.apache.mahout.clustering.iterator.ClusterWritable@5becf473
Key: 19: Value: org.apache.mahout.clustering.iterator.ClusterWritable@5becf473
Count: 20
Jan 25, 2014 10:43:35 AM org.slf4j.impl.JCLLoggerAdapter info
INFO: Program took 757 ms (Minutes: 0.012616666666666667)


> clusteredPoints do not contain a vector id
> ------------------------------------------
>
>                 Key: MAHOUT-1410
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-1410
>             Project: Mahout
>          Issue Type: Bug
>          Components: Clustering
>    Affects Versions: 0.8
>         Environment: using 0.9 release candidate
>            Reporter: Pat Ferrel
>            Assignee: Suneel Marthi
>             Fix For: 0.9
>
>         Attachments: MAHOUT-1410.patch, MAHOUT-1410.patch
>
>
> When clustering non-named vectors there are no vector ids in clusteredPoints 
> so the other values there, cluster id, vector values, distance-squared, pdf, 
> cannot be tied to any known vector.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

Reply via email to