[scikit-learn] Any way to pre-calculate number of cluster roughly?

2019-06-26 Thread lampahome
I see many ways like elbow method, silhouette score, they all define the cluster number after clustering. Especially the elbow method, I need to monitor the relation with cluster number and find the elbow. But if the dataset is too huge to let me find the elbow and I don't even how many cluster n

Re: [scikit-learn] Any way to pre-calculate number of cluster roughly?

2019-06-26 Thread Jamie Bull
A common rule of thumb is number of clusters = sqrt(number of items/2) http://www.ijarcsms.com/docs/paper/volume1/issue6/V1I6-0015.pdf On Wed, 26 Jun 2019 at 12:32, lampahome wrote: > I see many ways like elbow method, silhouette score, they all define the > cluster number after clustering. > >

Re: [scikit-learn] Any way to pre-calculate number of cluster roughly?

2019-06-26 Thread lampahome
Jamie Bull 於 2019年6月26日 週三 下午11:02寫道: > A common rule of thumb is number of clusters = sqrt(number of items/2) > http://www.ijarcsms.com/docs/paper/volume1/issue6/V1I6-0015.pdf > >> >> If I found it the number is too much, how to merge those groups? Calculate each silhouette score of groups or el