Github user smurching commented on a diff in the pull request:
https://github.com/apache/spark/pull/19186#discussion_r138139091
--- Diff:
mllib/src/main/scala/org/apache/spark/ml/param/shared/SharedParamsCodeGen.scala
---
@@ -82,7 +82,8 @@ private[shared] object SharedParamsCodeGen {
"all instance weights as 1.0"),
ParamDesc[String]("solver", "the solver algorithm for optimization",
finalFields = false),
ParamDesc[Int]("aggregationDepth", "suggested depth for
treeAggregate (>= 2)", Some("2"),
- isValid = "ParamValidators.gtEq(2)", isExpertParam = true))
+ isValid = "ParamValidators.gtEq(2)", isExpertParam = true),
+ ParamDesc[Boolean]("handlePersistence", "whether to handle data
persistence", Some("true")))
--- End diff --
This description could be a bit clearer, how about "if true, will cache
unpersisted input data before fitting estimator on it"?
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]