I have generated a cosine distance matrix and would like to apply
clustering algorithm to the given matrix.
np.shape(distance_matrix)==(14000,14000)

I would like to know which clustering suits better and is there any need to
process the data further to get it in the form so that a model can be
applied.
Also any performance tip as the matrix takes around 3-4 hrs of processing.
You can find my code here
https://github.com/maxyodedara5/BE_Project/blob/master/main.ipynb
Code for READ ONLY PURPOSE.
-- 
Regards
_______________________________________________
scikit-learn mailing list
scikit-learn@python.org
https://mail.python.org/mailman/listinfo/scikit-learn

Reply via email to