[ https://issues.apache.org/jira/browse/MAHOUT-99?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Rohini Uppuluri updated MAHOUT-99: ---------------------------------- Attachment: MAHOUT-99-1.patch Hi Grant, I have set them as optional arguments. I set those parameters to some reasonable defaults incase they are not given as input. I will be uploading the updated patch reflecting the change. It is a config thing already set up in hadoop but it gives us flexibility to change incase we want to increase the map tasks. Thanks, -Rohini > Improving speed of KMeans > ------------------------- > > Key: MAHOUT-99 > URL: https://issues.apache.org/jira/browse/MAHOUT-99 > Project: Mahout > Issue Type: Improvement > Components: Clustering > Reporter: Pallavi Palleti > Assignee: Grant Ingersoll > Attachments: MAHOUT-99-1.patch, MAHOUT-99.patch > > > Improved the speed of KMeans by passing only cluster ID from mapper to > reducer. Previously, whole Cluster Info as formatted s`tring was being sent. > Also removed the implicit assumption of Combiner runs only once approach and > the code is modified accordingly so that it won't create a bug when combiner > runs zero or more than once. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.