Choosing appropriate values for T1 and T2 for canopy clustering

Madhusudan Joshi Tue, 12 Apr 2011 23:23:19 -0700

I am been using kmeans to cluster some data. For initial cluster I used
random seed but that resulted in most of the documents (more than 70%) being
clustered in a single cluster. So I want to use canopy clustering to create
initial clusters but I am having problems selecting suitable values for T1
and T2. Is there any method to calculate the appropriate values depending
upon the number of documents used for clustering? Any help will be
appreciated.


-- 
Everything we hear is an opinion, not a fact.
Everything we see is perspective, not the truth.

Choosing appropriate values for T1 and T2 for canopy clustering

Reply via email to