GitHub user BryanCutler opened a pull request:
https://github.com/apache/spark/pull/11621
[SPARK-13430][PySpark][ML] Python API for training summaries of linear and
logistic regression
## What changes were proposed in this pull request?
Adding Python API for training summaries of LogisticRegression and
LinearRegression in PySpark ML.
## How was this patch tested?
Added unit tests to exercise the api calls for the summary classes. Also,
manually verified values are expected and match those from Scala directly.
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/BryanCutler/spark
pyspark-ml-summary-SPARK-13430
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/spark/pull/11621.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #11621
----
commit 18047484cf869ae5c6fce32c6b64b9069d709eae
Author: Bryan Cutler <[email protected]>
Date: 2016-03-02T01:33:20Z
[SPARK-13430] Added summary classes for logistic and linear regression
commit 57f15cd675cd50a82ef479286d1c027b0c7f700b
Author: Bryan Cutler <[email protected]>
Date: 2016-03-02T22:23:54Z
adding test for ml linear regression training summary
commit 4d4bf1a8766834bb49b7014057bac5c0a7f8a03a
Author: Bryan Cutler <[email protected]>
Date: 2016-03-03T01:32:09Z
completed test for ml linear regression training summary
commit f9da8e6df323f5c6447d6f9cae771b910023b3ef
Author: Bryan Cutler <[email protected]>
Date: 2016-03-03T21:53:49Z
adding test for ml logistic regression training summary
commit ce69f9d5d5748f95c63883c5920e59bbae4e3b79
Author: Bryan Cutler <[email protected]>
Date: 2016-03-03T22:53:02Z
changed residual to only check that DataFrame is returned
commit 5d9bf20341896607b6ceb66e2900883110ec9578
Author: Bryan Cutler <[email protected]>
Date: 2016-03-09T22:58:14Z
Merge remote-tracking branch 'upstream/master' into
pyspark-ml-summary-SPARK-13430
Conflicts:
python/pyspark/ml/classification.py
python/pyspark/ml/tests.py
----
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]