[ https://issues.apache.org/jira/browse/MAHOUT-931?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13176239#comment-13176239 ]
Jeff Eastman commented on MAHOUT-931: ------------------------------------- Renaming existing entities may be appropriate, but that ought to be done as a separate, independent and agreed-upon change. Otherwise we do not have a consistent vocabulary to discuss the functionality issues. Can we hold off on renaming until we get a bit more of the semantics defined? I tend to agree that implementing a set of algorithm-specific clustering policy objects will enable many (not all) of the current implementations to be re-implemented with the ClusterClassifier/Iterator. I think we will need to preserve the existing driver classes which support CLI argument selection in their run() methods but that the buildClusters methods would be revamped to use the new implementation. It does seem like these policy objects need to encapsulate the relevant CLI arguments so we are in synch there. The clusterPoints methods can also be re-implemented using the new clustering postprocessor in MAHOUT-929. > Implement a pluggable outlier removal capability for cluster classifiers > ------------------------------------------------------------------------ > > Key: MAHOUT-931 > URL: https://issues.apache.org/jira/browse/MAHOUT-931 > Project: Mahout > Issue Type: Improvement > Components: Classification, Clustering > Affects Versions: 0.6 > Reporter: Paritosh Ranjan > Fix For: 0.7 > > Attachments: MAHOUT-931 > > > A pluggable outlier removal capability while classifying the clusters is > needed. The classification and outlier removal implementations, both should > be completely separate entities for better abstraction. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira