[ 
https://issues.apache.org/jira/browse/MAHOUT-5?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jeff Eastman updated MAHOUT-5:
------------------------------

    Attachment: MAHOUT-5c.diff

Added generic declarations to make most of the annoying little Eclipse (!) 
marks go away. Modified the canopy and kmeans reducers to output the same 
format (key=identifierString; value=formatString) so that canopy clusters can 
be used directly for input to kmeans. Added a unit test thereof to prove it 
works.

All unit tests run.

This patch should be applied to trunk r629348.

> Implement a k-means clustering prototype 
> -----------------------------------------
>
>                 Key: MAHOUT-5
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-5
>             Project: Mahout
>          Issue Type: New Feature
>          Components: Clustering
>    Affects Versions: 0.1
>            Reporter: Jeff Eastman
>            Assignee: Jeff Eastman
>            Priority: Minor
>         Attachments: kmeans.zip, MAHOUT-5a.diff, MAHOUT-5b.diff, 
> MAHOUT-5c.diff
>
>
> K-means clustering is closely related to Canopy clustering and often uses 
> canopies to determine the initial clusters. I'd like to implement a k-means 
> prototype and tests in the package org.apache.mahout.clustering.kmeans. 

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to