Hi, In the silhouette example ( http://scikit-learn.org/stable/auto_examples/cluster/plot_kmeans_silhouette_analysis.html#sphx-glr-auto-examples-cluster-plot-kmeans-silhouette-analysis-py), the silhouette values of each sample is computed twice: once with *silhouette_score *and once with *silhouette_samples.* The call to *silhouette_score* can be easily avoided by computing the average of the result of* silhouette_samples*.
Do you think we should remove the call to *silhouette_score* to improve the performance ? Or it is better to keep the two functions to show how to use them ? Anaël Bonneton
_______________________________________________ scikit-learn mailing list scikit-learn@python.org https://mail.python.org/mailman/listinfo/scikit-learn