[ https://issues.apache.org/jira/browse/MAHOUT-984?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13232455#comment-13232455 ]
Saikat Kanjilal commented on MAHOUT-984: ---------------------------------------- Paritosh, I am in the middle of the refactoring and had some questions, I removed the clusterDataMR and clusterDataSeq and replaced this with the clusterData similar to what you had setup for the kmeans, however for the fuzzy kmeans there are two additional parameters convergenceDelta and m, I was wondering how and where to take these parameters into account, the signature of the new clusterData function is shown below: public static void clusterData(Path input, Path clustersIn, Path output, DistanceMeasure measure, double convergenceDelta, float m, boolean emitMostLikely, double threshold, boolean runSequential) throws IOException, ClassNotFoundException, InterruptedException { if (log.isInfoEnabled()) { log.info("Running Clustering"); log.info("Input: {} Clusters In: {} Out: {} Distance: {}", new Object[] {input, clustersIn, output, measure}); } ClusterClassifier.writePolicy(new FuzzyKMeansClusteringPolicy(), clustersIn); ClusterClassificationDriver.run(input, output, new Path(output, CLUSTERED_POINTS_DIRECTORY), threshold, true, runSequential); } Let me know your thoughts Thanks > Refactor Fuzzy K Means Clustering into a separate post process with outlier > pruning > ----------------------------------------------------------------------------------- > > Key: MAHOUT-984 > URL: https://issues.apache.org/jira/browse/MAHOUT-984 > Project: Mahout > Issue Type: Sub-task > Components: Clustering > Affects Versions: 0.6 > Reporter: Paritosh Ranjan > Assignee: Paritosh Ranjan > Labels: clustering > Fix For: 0.7 > > > Use ClusterClassificationDriver to refactor clustering out of > FuzzyKMeansDriver with outlier pruning support. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira