[
https://issues.apache.org/jira/browse/MAHOUT-3?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Jeff Eastman updated MAHOUT-3:
------------------------------
Attachment: MAHOUT-3e.diff
This patch refactors the canopy configuration from the various mappers,
combiners & reducers configure method into a single static configure method on
Canopy. I changed the DistanceMeasure creation from explicit tests to more
generic class instance creation and added a unit test thereof. I also made
DistanceMeasure extend JobConfigurable so that they can be configured.
Configurability will allow me to create a WeightedManhattanDistanceMeasure
outside of the mahout library. It seems making distance measures job
configurable increases their versatility at no cost to the library.
Finally, all unit tests still run. The build.xml test target works for me too.
> Build initial canopy clustering prototype
> -----------------------------------------
>
> Key: MAHOUT-3
> URL: https://issues.apache.org/jira/browse/MAHOUT-3
> Project: Mahout
> Issue Type: New Feature
> Components: Clustering
> Reporter: Jeff Eastman
> Assignee: Grant Ingersoll
> Priority: Minor
> Fix For: 0.1
>
> Attachments: MAHOUT-3.diff, MAHOUT-3a.diff, MAHOUT-3b.diff,
> MAHOUT-3c.diff, MAHOUT-3d.diff, MAHOUT-3e.diff
>
>
> I'd like to reserve some namespace, specifically
> org.apache.mahout.clustering.canopy to use for an initial prototype of canopy
> clustering. I'm going to start with a little unit test to get the basic
> algorithm sorted out, then M/R it.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.