[ https://issues.apache.org/jira/browse/SPARK-5956?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Xiangrui Meng updated SPARK-5956: --------------------------------- Priority: Blocker (was: Major) > Transformer/Estimator should be copyable. > ----------------------------------------- > > Key: SPARK-5956 > URL: https://issues.apache.org/jira/browse/SPARK-5956 > Project: Spark > Issue Type: Sub-task > Components: ML > Affects Versions: 1.3.0 > Reporter: Xiangrui Meng > Assignee: Xiangrui Meng > Priority: Blocker > > In a pipeline, we don't save additional params specified in `fit()` to > transformers, because we should not modify them. The current solution is to > store training parameters in the pipeline model and apply those parameters at > `transform()`. A better solution would be making transformers copyable. > Calling `.copy` on a transformer produces a new transformer with a different > UID but same parameters. Then we can use the copied transformers in the > pipeline model, with additional params stored. > `copy` may not be a good name because it is not an exact copy. -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org