GitHub user ajaysaini725 opened a pull request:

    [Spark-17025][ML][Python] Persistence for Custom Python-only Pipelines

    ## What changes were proposed in this pull request?
    Implemented a Python-only persistence framework for pipelines containing 
stages that cannot be saved using Java.
    ## How was this patch tested?
    Created a custom Python-only UnaryTransformer, included it in a Pipeline, 
and saved/loaded the pipeline. The loaded pipeline was compared against the 
original using _compare_pipelines() in

You can merge this pull request into a Git repository by running:

    $ git pull PythonPipelines

Alternatively you can review and apply these changes as the patch at:

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #18888
commit 840a19324f54a1077d59a7cd6f8e911f57505370
Author: Ajay Saini <>
Date:   2017-08-08T23:24:23Z

    Pipeline persistence commit with tests.

commit 85a98d6fa45e7f6bb32af9b949913e70b32f8ca9
Author: Ajay Saini <>
Date:   2017-08-08T23:30:33Z

    Fixed import


If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at or file a JIRA ticket
with INFRA.

To unsubscribe, e-mail:
For additional commands, e-mail:

Reply via email to