Clusterdumper - Get rid of map based implementation
---------------------------------------------------
Key: MAHOUT-940
URL: https://issues.apache.org/jira/browse/MAHOUT-940
Project: Mahout
Issue Type: Improvement
Components: Clustering
Affects Versions: 0.6
Reporter: Paritosh Ranjan
Fix For: 0.7
Current implementation of ClusterDumper puts clusters and related vectors in
map. This generally results in OOM.
Since ClusterOutputProcessor is availabale now. The ClusterDumper will at first
process the clusteredPoints, and then write down the clusters to a local file.
The inability to properly read the clustering output due to ClusterDumper
facing OOM is seen too often in the mailing list. This improvement will fix
that problem.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira