Github user holdenk commented on a diff in the pull request:
https://github.com/apache/spark/pull/20629#discussion_r181471237
--- Diff: python/pyspark/ml/clustering.py ---
@@ -322,7 +323,11 @@ def computeCost(self, dataset):
"""
Return the K-means cost (sum of squared distances of points to
their nearest center)
for this model on the given data.
+
+ ..note:: Deprecated in 2.4.0. It will be removed in 3.0.0. Use
ClusteringEvaluator instead.
"""
+ warnings.warn("Deprecated in 2.4.0. It will be removed in 3.0.0.
Use ClusteringEvaluator"
--- End diff --
If we do go this path we need to file a follow up JIRA to update Python
ClusteringEvaluator.
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]