I have generated a cosine distance matrix and would like to apply
clustering algorithm to the given matrix.
I would like to know which clustering suits better and is there any need to
process the data further to get it in the form so that a model can be
Also any performance tip as the matrix takes around 3-4 hrs of processing.
You can find my code here
Code for READ ONLY PURPOSE.
scikit-learn mailing list