[
https://issues.apache.org/jira/browse/MAHOUT-1223?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13662930#comment-13662930
]
Hudson commented on MAHOUT-1223:
--------------------------------
Integrated in Mahout-Quality #2009 (See
[https://builds.apache.org/job/Mahout-Quality/2009/])
MAHOUT-1223: Fixed point being skipped in StreamingKMeans when iterating
through centroids from a reducer
When calling StreamingKMeans in the reducer (to collapse the number of clusters
to they can fit into memory), the clustering is done on the Hadoop reducer
iterable.
Currently, the first Centroid is added directly as a special case and then is
skipped when iterating through the main loop.
However, Hadoop reducer iterables cannot be rewound therefore causing SKM to
skip one point. (Revision 1484747)
Result = SUCCESS
dfilimon :
http://svn.apache.org/viewcvs.cgi/?root=Apache-SVN&view=rev&rev=1484747
Files :
* /mahout/trunk/CHANGELOG
*
/mahout/trunk/core/src/main/java/org/apache/mahout/clustering/streaming/cluster/StreamingKMeans.java
> Point skipped in StreamingKMeans when iterating through centroids from a
> reducer
> --------------------------------------------------------------------------------
>
> Key: MAHOUT-1223
> URL: https://issues.apache.org/jira/browse/MAHOUT-1223
> Project: Mahout
> Issue Type: Bug
> Components: Clustering
> Affects Versions: 0.8
> Reporter: Dan Filimon
> Priority: Minor
> Fix For: 0.8
>
>
> When calling StreamingKMeans in the reducer (to collapse the number of
> clusters to they can fit into memory), the clustering is done on the Hadoop
> reducer iterable.
> Currently, the first Centroid is added directly as a special case and then is
> skipped when iterating through the main loop.
> However, Hadoop reducer iterables cannot be rewound therefore causing SKM to
> skip one point.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira