[ 
https://issues.apache.org/jira/browse/MAHOUT-3?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jeff Eastman updated MAHOUT-3:
------------------------------

    Attachment: MAHOUT-3e.diff

This patch refactors the canopy configuration from the various mappers, 
combiners & reducers configure method into a single static configure method on 
Canopy. I changed the DistanceMeasure creation from explicit tests to more 
generic class instance creation and added a unit test thereof.  I also made 
DistanceMeasure extend JobConfigurable so that they can be configured. 
Configurability will allow me to create a WeightedManhattanDistanceMeasure 
outside of the mahout library. It seems making distance measures job 
configurable increases their versatility at no cost to the library.

Finally, all unit tests still run. The build.xml test target works for me too.

> Build initial canopy clustering prototype
> -----------------------------------------
>
>                 Key: MAHOUT-3
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-3
>             Project: Mahout
>          Issue Type: New Feature
>          Components: Clustering
>            Reporter: Jeff Eastman
>            Assignee: Grant Ingersoll
>            Priority: Minor
>             Fix For: 0.1
>
>         Attachments: MAHOUT-3.diff, MAHOUT-3a.diff, MAHOUT-3b.diff, 
> MAHOUT-3c.diff, MAHOUT-3d.diff, MAHOUT-3e.diff
>
>
> I'd like to reserve some namespace, specifically 
> org.apache.mahout.clustering.canopy to use for an initial prototype of canopy 
> clustering. I'm going to start with a little unit test to get the basic 
> algorithm sorted out, then M/R it.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to