Github user mgaido91 commented on the issue:
https://github.com/apache/spark/pull/20629
thanks for taking a look at this @MLnick. No, it doesn't, in the sense that
it returns a different result: this is the sum of the squared euclidean
distance between a point and the centroid of the cluster it is assigned to,
while the silhouette metric is the average of the silhouette coefficient. So
they are completely different formulas.
The semantic is a bit different too. Silhouette measures both cohesion and
separation of the clusters, while `computeCost` as it is measures only cohesion.
Nonetheless, of course both them can be used to evaluate the result of a
clustering algorithm, even though the silhouette is much better for this
purpose.
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]