Github user BryanCutler commented on a diff in the pull request:
https://github.com/apache/spark/pull/21557#discussion_r198675445
--- Diff: python/pyspark/ml/clustering.py ---
@@ -622,10 +621,10 @@ def __init__(self, featuresCol="features",
predictionCol="prediction", maxIter=2
@keyword_only
@since("2.0.0")
def setParams(self, featuresCol="features",
predictionCol="prediction", maxIter=20,
- seed=None, k=4, minDivisibleClusterSize=1.0):
+ seed=None, k=4, minDivisibleClusterSize=1.0,
distanceMeasure="euclidean"):
"""
setParams(self, featuresCol="features",
predictionCol="prediction", maxIter=20, \
- seed=None, k=4, minDivisibleClusterSize=1.0)
+ seed=None, k=4, minDivisibleClusterSize=1.0,
distanceMeasure="euclidean")
Sets params for BisectingKMeans.
--- End diff --
I know we already have `setDistanceMeasure` and `getDistanceMeasure`
methods from the shared param, but can you also add them here so we can use the
`since` decorator? (same as KMeans)
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]