GitHub user holdenk opened a pull request:
https://github.com/apache/spark/pull/19876
[WIP][ML][SPARK-11171] spark 11237 Add PMML export to Spark ML pipelines
## What changes were proposed in this pull request?
Adds PMML export support to Spark ML pipelines in the style of Spark's
DataSource API to allow library authors to add their own model export formats.
This is a WIP to see if this is the design we want to go with.
## How was this patch tested?
Basic unit test.
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/holdenk/spark
SPARK-11171-SPARK-11237-Add-PMML-export-for-ML-KMeans-r2
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/spark/pull/19876.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #19876
----
commit 43ae30f08aed921178da07a5e982297b272c7c8f
Author: Holden Karau <[email protected]>
Date: 2017-11-24T14:00:16Z
Initial attempt at allowing Spark ML writers to be slightly more pluggable
commit 9fec08fbd2dd1c980d5862f0b4521213e1e9349c
Author: Holden Karau <[email protected]>
Date: 2017-11-25T12:55:19Z
The LinearRegression suite passes
commit 0075bf4776ecffa7fcb24a6f74c0e96161d6221c
Author: Holden Karau <[email protected]>
Date: 2017-11-25T13:00:18Z
Add missing META-INFO for MLFormatRegister
commit c68880d6d982c56934f4b583263ed5cd4e8329d6
Author: Holden Karau <[email protected]>
Date: 2017-11-25T16:19:35Z
Add a (untested) PMMLLinearRegressionModelWriter
commit c2108df2b499bd45dff0e8add789f01d8c3c2c48
Author: Holden Karau <[email protected]>
Date: 2017-12-04T10:00:56Z
Basic PMML export test
commit de8619098eeb01ff86b54753f27c29729935bb94
Author: Holden Karau <[email protected]>
Date: 2017-12-04T11:27:03Z
Add PMML testing utils for Spark ML that were accidently left out
----
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]