GitHub user evanyc15 opened a pull request:
https://github.com/apache/spark/pull/10270
[SPARK-10931][PYSPARK][ML] PySpark ML Models should contain Param values
PySpark spark.ml Models are generally wrappers around Java objects and do
not even contain Param values. This JIRA is for copying the Param values from
the Estimator to the model.
This can likely be solved by modifying Estimator.fit to copy Param values,
but should also include proper unit tests.
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/evanyc15/spark SPARK-10931-pyspark-mllib
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/spark/pull/10270.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #10270
----
commit 53062d1edc08bf89b7cdb46969c182aa0f26dbe4
Author: Evan Chen <[email protected]>
Date: 2015-11-19T03:54:57Z
Copied parameters over from Estimator to Transformer
commit f0b124a1f67037f854d1e7891091ba4d1cdcecc8
Author: Evan Chen <[email protected]>
Date: 2015-11-24T00:44:53Z
Estimator UID is being copied correctly to the Transformer model objects
and params now, working on Doctests
commit 1c5a791775f7f078b3a488c5ea88beed29c2a8d7
Author: Evan Chen <[email protected]>
Date: 2015-11-25T00:16:32Z
Changed the way parameters are copied from the Estimator to Transformer
commit 332cc670b61c5bd19cb5cea705a307440fc92868
Author: Evan Chen <[email protected]>
Date: 2015-12-01T22:51:24Z
Checkpoint, switching back to inheritance method
commit 07fbbfd91692ecb61b0e8659ee296dfaf3150f13
Author: Evan Chen <[email protected]>
Date: 2015-12-02T00:54:41Z
Working on DocTests
commit d86e1dfb33aadfae3a151edf0ceaa6593cfa074e
Author: Evan Chen <[email protected]>
Date: 2015-12-03T02:07:05Z
Implemented Doctests for Recommendation, Clustering, Classification (except
RandomForestClassifier), Evaluation, Tuning, Regression (except
RandomRegression)
commit a5902cfc6622eb4c6c5d83a489f6693b08f04518
Author: Evan Chen <[email protected]>
Date: 2015-12-04T23:20:42Z
Ready for Code Review
commit 24dd45a30b75c9b7e33edf37993b2277f5cbe606
Author: Evan Chen <[email protected]>
Date: 2015-12-11T01:35:40Z
Code Review changeset #1
----
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]