Github user mgaido91 commented on the issue:

    https://github.com/apache/spark/pull/20629
  
    thanks for taking a look at this @MLnick. No, it doesn't, in the sense that 
it returns a different result: this is the sum of the squared euclidean 
distance between a point and the centroid of the cluster it is assigned to, 
while the silhouette metric is the average of the silhouette coefficient. So 
they are completely different formulas.
    
    The semantic is a bit different too. Silhouette measures both cohesion and 
separation of the clusters, while `computeCost` as it is measures only cohesion.
    
    Nonetheless, of course both them can be used to evaluate the result of a 
clustering algorithm, even though the silhouette is much better for this 
purpose.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to