[ 
https://issues.apache.org/jira/browse/MAHOUT-1206?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13672489#comment-13672489
 ] 

Ted Dunning commented on MAHOUT-1206:
-------------------------------------

I put some questions on here.

I am still somewhat dubious of these algorithms for large data.

If you could suggest more concrete information about how you would implement 
them, that would help.

The other questions about need and support are important as well.
                
> Add density-based clustering algorithms to mahout
> -------------------------------------------------
>
>                 Key: MAHOUT-1206
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-1206
>             Project: Mahout
>          Issue Type: Improvement
>            Reporter: Yexi Jiang
>              Labels: clustering
>             Fix For: Backlog
>
>
> The clustering algorithms (kmeans, fuzzy kmeans, dirichlet clustering, and 
> spectral cluster) clustering data by assuming that the data can be clustered 
> into the regular hyper sphere or ellipsoid. However, in practical, not all 
> the data can be clustered in this way. 
> To enable the data to be clustered in arbitrary shapes, clustering algorithms 
> like DBSCAN, BIRCH, CLARANCE 
> (http://en.wikipedia.org/wiki/Cluster_analysis#Density-based_clustering) are 
> proposed.
> It is better that we can implement one or some of these clustering algorithm 
> to enrich the clustering library. 

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to