Github user BryanCutler commented on a diff in the pull request:
https://github.com/apache/spark/pull/19122#discussion_r138151063
--- Diff: python/pyspark/ml/tuning.py ---
@@ -193,7 +194,8 @@ class CrossValidator(Estimator, ValidatorParams,
MLReadable, MLWritable):
>>> lr = LogisticRegression()
>>> grid = ParamGridBuilder().addGrid(lr.maxIter, [0, 1]).build()
>>> evaluator = BinaryClassificationEvaluator()
- >>> cv = CrossValidator(estimator=lr, estimatorParamMaps=grid,
evaluator=evaluator)
+ >>> cv = CrossValidator(estimator=lr, estimatorParamMaps=grid,
evaluator=evaluator,
+ ... parallelism=2)
--- End diff --
Are you planning on adding a unit test to verify that parallel has the same
results as serial?
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]