GitHub user yinxusen opened a pull request:
https://github.com/apache/spark/pull/11266
[SPARK-13038][SPARK-13388][PySpark] Add load/save to pipeline and change it
to use the Scala implementation
## What changes were proposed in this pull request?
JIRA issues:
- https://issues.apache.org/jira/browse/SPARK-13038
- https://issues.apache.org/jira/browse/SPARK-13388
## How was the patch tested?
The patch is tested with Python doctest. I remove the original unit test
for Pipeline and PipelineModel since the `MockEstimator` and `MockTransformer`
are not suitable to use in Pipeline now because they have no Java Object.
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/yinxusen/spark SPARK-13038
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/spark/pull/11266.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #11266
----
commit 9c3bf463d975b3175d4edaf5c09dd3691f2348b6
Author: Xusen Yin <[email protected]>
Date: 2016-02-16T22:31:15Z
split pipeline with base classes
commit f9198233169ee7f35af18161195914f0f7733c4f
Author: Xusen Yin <[email protected]>
Date: 2016-02-19T06:53:16Z
add save load for Pipeline and change it to depend on Spark side
commit b55bc606531ac55529c857314a5e8db9ab2fe24b
Author: Xusen Yin <[email protected]>
Date: 2016-02-19T07:52:49Z
add PipelineModel save load
----
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]