[ https://issues.apache.org/jira/browse/MAHOUT-931?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13176331#comment-13176331 ]
Jeff Eastman commented on MAHOUT-931: ------------------------------------- 1. I don't see a reason to introduce ClusterConfigs yet. I believe the various CLI arguments can be carried in the appropriate ClusteringPolicy implementations. 2. Other than augmenting what exist already with some more CLI arguments, I think this is done 3. Outlier removal is not a part of the buildClusters step, rather the clusterPoints step. I thought you were going to work on those stories while I finish up the mapreduce implementation of buildClusters using ClusterIterator/Classifier/Policies (MAHOUT-933)? This story (MAHOUT-931) should follow after -929 & -930, IMHO, for example: - 929 implement a new post processor that does only classification as required by the various clusterPoints steps. - 930 modify the existing drivers to use this post processor rather than their current, custom implementations. - 931 modify the post processor to support pluggable outlier removal. 4. This can be done once -933 is complete. In any case, this is all post-0.6 stuff. Let's leave trunk where it is with the renaming for now. > Implement a pluggable outlier removal capability for cluster classifiers > ------------------------------------------------------------------------ > > Key: MAHOUT-931 > URL: https://issues.apache.org/jira/browse/MAHOUT-931 > Project: Mahout > Issue Type: Improvement > Components: Classification, Clustering > Affects Versions: 0.6 > Reporter: Paritosh Ranjan > Fix For: 0.7 > > Attachments: MAHOUT-931 > > > A pluggable outlier removal capability while classifying the clusters is > needed. The classification and outlier removal implementations, both should > be completely separate entities for better abstraction. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira