Hi everybody.
I recently opened an issue on renaming the clustering parameters:
https://github.com/scikit-learn/scikit-learn/issues/844

At the moment, the parameter in KMeans and MiniBatchKMeans and 
SpectralClustering is called k,
and n_clusters in ward.
The number of cluster centers in GMM is called n_components.

Olivier suggested "n_components" for all the clustering algorithms, 
while Mathieu suggested
"n_clusters" so to make the destinction between clustering and 
factorization more clear.

I felt this should be discussed with a wider audience.

The current idea would be to use n_clusters for all clustering 
algorithms and n_components
for GMM.

Comments?


Cheers,
Andy


------------------------------------------------------------------------------
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and 
threat landscape has changed and how IT managers can respond. Discussions 
will include endpoint security, mobile security and the latest in malware 
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
_______________________________________________
Scikit-learn-general mailing list
Scikit-learn-general@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/scikit-learn-general

Reply via email to