Adam Bozanich created MAHOUT-1157:
-------------------------------------
Summary: AbstractCluster.formatVector iteration bug.
Key: MAHOUT-1157
URL: https://issues.apache.org/jira/browse/MAHOUT-1157
Project: Mahout
Issue Type: Bug
Components: Clustering
Affects Versions: 0.7
Reporter: Adam Bozanich
AbstractCluster.formatVector's use of the size field of the given vector causes
problems when the vector is sparse.
When reading WeightedVectorWriteables from the clusteredPoints directory that
was created by running kmeans with the -cl flag, the embedded
RandomAccessSparseVector is being instantiated with
I clustered a handful of vectors which had been initialized with a cardinality
of Integer.MAX_VALUE. Running seqdump on the resulting clusteredPoints took
over four minutes.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira