Re: [Scikit-learn-general] problem clustering using PCA and kmeans

2012-09-10 Thread Aliabbas Petiwala
Apart from the above problem , can anyone suggest how to extract cluster information from dendrogram in scikit, more specifically i want the clusters to be returned as lists of file names of the documents? Thanks On Sun, Sep 9, 2012 at 4:50 PM, Aliabbas Petiwala wrote: > Thanks Olivier that hel

Re: [Scikit-learn-general] problem clustering using PCA and kmeans

2012-09-09 Thread Aliabbas Petiwala
Thanks Olivier that helped to show me the output, but for the same code as given before i am not getting proper clusters as shown in the plot below there are no clearly disparate clusters , the points seems to overlap. But using heirarchical clustering on same dataset i did find about 7 disparate

Re: [Scikit-learn-general] problem clustering using PCA and kmeans

2012-09-08 Thread Olivier Grisel
2012/9/8 Aliabbas Petiwala : > Hi, > i am trying to cluster a list of text docs based on similarity by first > identifying the clusters using PCA and then proceeding with a kmeans using > the results of PCA as shown below. tHE PROBLEM is that the kmeans does > output the 3 clusters but the plot fun

[Scikit-learn-general] problem clustering using PCA and kmeans

2012-09-08 Thread Aliabbas Petiwala
Hi, i am trying to cluster a list of text docs based on similarity by first identifying the clusters using PCA and then proceeding with a kmeans using the results of PCA as shown below. tHE PROBLEM is that the kmeans does output the 3 clusters but the plot function fails to display the clustering r