I have generated a cosine distance matrix and would like to apply clustering algorithm to the given matrix. np.shape(distance_matrix)==(14000,14000)
I would like to know which clustering suits better and is there any need to process the data further to get it in the form so that a model can be applied. Also any performance tip as the matrix takes around 3-4 hrs of processing. You can find my code here https://github.com/maxyodedara5/BE_Project/blob/master/main.ipynb Code for READ ONLY PURPOSE. -- Regards
_______________________________________________ scikit-learn mailing list scikit-learn@python.org https://mail.python.org/mailman/listinfo/scikit-learn