Dan Filimon created MAHOUT-1224:
-----------------------------------
Summary: Add the option of running a StreamingKMeans pass in the
Reducer before BallKMeans
Key: MAHOUT-1224
URL: https://issues.apache.org/jira/browse/MAHOUT-1224
Project: Mahout
Issue Type: New Feature
Components: Clustering
Affects Versions: 0.8
Reporter: Dan Filimon
Sometimes, the number of points passed to the reducer from the mappers in the
StreamingKMeansDriver job is too large to fit into memory.
In that case, applying another StreamingKMeans pass can collapse the mapper
intermediate clusters to a more manageable size to be clustered.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira