Re: [scikit-learn] Construct the microclusters using a CF-Tree

2017-07-06 Thread Roman Yurchak
Hello Sema, On 05/07/17 13:27, Sema Atasever wrote: How can i know which cluster member represents best each cluster? You could try to pick the one that's closest to the cluster centroid.. In the birch code i use this code line: *centroids = brc.subcluster_centers_* How do I interpret this l

Re: [scikit-learn] Construct the microclusters using a CF-Tree

2017-07-03 Thread Roman Yurchak
Hello Sema, as far as I can tell, in your dataset you has n_samples=65909, n_features=539. Clustering high dimensional data is problematic for a number of reasons, https://en.wikipedia.org/wiki/Clustering_high-dimensional_data#Problems besides the BIRCH implementation doesn't scale well for

Re: [scikit-learn] Construct the microclusters using a CF-Tree

2017-07-03 Thread Sema Atasever
Dear Roman, When I try the code with the original data (*data.dat*) as you suggested, I get the following error : *Memory Error* --> (*error.png*), how can i overcome this problem, thank you so much in advance. ​ data.dat

Re: [scikit-learn] Construct the microclusters using a CF-Tree

2017-06-30 Thread Roman Yurchak
Hello Sema, On 30/06/17 17:14, Sema Atasever wrote: I want to cluster them using Birch clustering algorithm. Does this method have 'precomputed' option. No it doesn't, see http://scikit-learn.org/stable/modules/generated/sklearn.cluster.Birch.html so you would need to provide it with the ori

[scikit-learn] Construct the microclusters using a CF-Tree

2017-06-30 Thread Sema Atasever
Hi all, I want to ask you about clustering usign Birch clustering algorithm. I have a *distance matrix* n*n M where M_ij is the distance between object_i and object_j.(You can see file format in the attachment). I want to cluster them using Birch clustering algorithm. Does this method have 'preco