[jira] [Commented] (MAHOUT-1976) Add Canopy Clustering Algorithm

Hudson (JIRA) Sat, 20 May 2017 21:41:46 -0700

    [ 
https://issues.apache.org/jira/browse/MAHOUT-1976?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16018690#comment-16018690
 ]


Hudson commented on MAHOUT-1976:
--------------------------------

SUCCESS: Integrated in Jenkins build Mahout-Quality #3488 (See 
[https://builds.apache.org/job/Mahout-Quality/3488/])
MAHOUT-1976 Canopy Clustering closes apache/mahout#314 (rawkintrevo: rev 
c29496cb11372baddbb76acdee51530347525645)
* (add) 
math-scala/src/main/scala/org/apache/mahout/math/algorithms/clustering/ClusteringModel.scala
* (add) 
flink/src/test/scala/org/apache/mahout/flinkbindings/standard/ClusteringSuite.scala
* (add) 
math-scala/src/main/scala/org/apache/mahout/math/algorithms/common/distance/DistanceMetrics.scala
* (add) website/docs/algorithms/clustering/canopy/SampleData.png
* (add) website/docs/algorithms/clustering/index.md
* (edit) website/docs/_includes/algo_navbar.html
* (add) 
spark/src/test/scala/org/apache/mahout/math/algorithms/ClusteringSuite.scala
* (add) website/docs/algorithms/clustering/canopy/index.md
* (add) 
math-scala/src/test/scala/org/apache/mahout/math/algorithms/ClusteringSuiteBase.scala
* (add) 
h2o/src/test/scala/org/apache/mahout/math/algorithms/ClusteringSuite.scala
* (add) website/docs/algorithms/clustering/canopy/Canopy10.png
* (add) website/docs/algorithms/clustering/canopy/Canopy.png
* (add) 
math-scala/src/main/scala/org/apache/mahout/math/algorithms/clustering/Canopy.scala
* (add) website/docs/algorithms/clustering/distance-metrics.md
* (edit) website/docs/algorithms/map-reduce/clustering/canopy-clustering.md


> Add Canopy Clustering Algorithm
> -------------------------------
>
>                 Key: MAHOUT-1976
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-1976
>             Project: Mahout
>          Issue Type: Improvement
>          Components: Algorithms
>    Affects Versions: 0.13.2
>            Reporter: Trevor Grant
>            Assignee: Trevor Grant
>
> Primarily, we need to lay out the clustering section of the Algorihtms 
> Framework.
> The Canopy Clustering Algorithm is very simple and yet very useful as a 
> preprocessing step for more advanced clustering algorithms such as KMeans and 
> Hierarchical Clustering. 
> https://en.wikipedia.org/wiki/Canopy_clustering_algorithm
> The majority of the "work" on this PR will be creating the framework. 
> It is also one of the Legacy MR algorithms that would be nice to port.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

[jira] [Commented] (MAHOUT-1976) Add Canopy Clustering Algorithm

Reply via email to