[ https://issues.apache.org/jira/browse/MAHOUT-1976?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16018690#comment-16018690 ]
Hudson commented on MAHOUT-1976: -------------------------------- SUCCESS: Integrated in Jenkins build Mahout-Quality #3488 (See [https://builds.apache.org/job/Mahout-Quality/3488/]) MAHOUT-1976 Canopy Clustering closes apache/mahout#314 (rawkintrevo: rev c29496cb11372baddbb76acdee51530347525645) * (add) math-scala/src/main/scala/org/apache/mahout/math/algorithms/clustering/ClusteringModel.scala * (add) flink/src/test/scala/org/apache/mahout/flinkbindings/standard/ClusteringSuite.scala * (add) math-scala/src/main/scala/org/apache/mahout/math/algorithms/common/distance/DistanceMetrics.scala * (add) website/docs/algorithms/clustering/canopy/SampleData.png * (add) website/docs/algorithms/clustering/index.md * (edit) website/docs/_includes/algo_navbar.html * (add) spark/src/test/scala/org/apache/mahout/math/algorithms/ClusteringSuite.scala * (add) website/docs/algorithms/clustering/canopy/index.md * (add) math-scala/src/test/scala/org/apache/mahout/math/algorithms/ClusteringSuiteBase.scala * (add) h2o/src/test/scala/org/apache/mahout/math/algorithms/ClusteringSuite.scala * (add) website/docs/algorithms/clustering/canopy/Canopy10.png * (add) website/docs/algorithms/clustering/canopy/Canopy.png * (add) math-scala/src/main/scala/org/apache/mahout/math/algorithms/clustering/Canopy.scala * (add) website/docs/algorithms/clustering/distance-metrics.md * (edit) website/docs/algorithms/map-reduce/clustering/canopy-clustering.md > Add Canopy Clustering Algorithm > ------------------------------- > > Key: MAHOUT-1976 > URL: https://issues.apache.org/jira/browse/MAHOUT-1976 > Project: Mahout > Issue Type: Improvement > Components: Algorithms > Affects Versions: 0.13.2 > Reporter: Trevor Grant > Assignee: Trevor Grant > > Primarily, we need to lay out the clustering section of the Algorihtms > Framework. > The Canopy Clustering Algorithm is very simple and yet very useful as a > preprocessing step for more advanced clustering algorithms such as KMeans and > Hierarchical Clustering. > https://en.wikipedia.org/wiki/Canopy_clustering_algorithm > The majority of the "work" on this PR will be creating the framework. > It is also one of the Legacy MR algorithms that would be nice to port. -- This message was sent by Atlassian JIRA (v6.3.15#6346)