[
https://issues.apache.org/jira/browse/SPARK-4766?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14323242#comment-14323242
]
Joseph K. Bradley commented on SPARK-4766:
------------------------------------------
[~prudenko] That's a great point about avoiding re-computing the same RDD when
doing cross-validation. I think it should be done by specializing
Pipeline.fit(Array[ParamMap]). I've made a JIRA for that (linked above).
> ML Estimator Params should subclass Transformer Params
> ------------------------------------------------------
>
> Key: SPARK-4766
> URL: https://issues.apache.org/jira/browse/SPARK-4766
> Project: Spark
> Issue Type: Improvement
> Components: ML
> Affects Versions: 1.2.0
> Reporter: Joseph K. Bradley
>
> Currently, in spark.ml, both Transformers and Estimators extend the same
> Params classes. There should be one Params class for the Transformer and one
> for the Estimator, where the Estimator params class extends the Transformer
> one.
> E.g., it is weird to be able to do:
> {code}
> val model: LogisticRegressionModel = ...
> model.getMaxIter()
> {code}
> (This is the only case where this happens currently, but it is worth setting
> a precedent.)
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]