GitHub user holdenk opened a pull request:
https://github.com/apache/spark/pull/10298
[SPARK-12296][PYSPARK][MLLIB] Feature parity for pyspark mllib standard
scaler model
Some methods are missing, such as ways to access the std, mean, etc. This
PR is for feature parity for pyspark.mllib.feature.StandardScaler &
StandardScalerModel.
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/holdenk/spark
SPARK-12296-feature-parity-pyspark-mllib-StandardScalerModel
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/spark/pull/10298.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #10298
----
commit 712ac0a59af30e09fb43f914aa35225249f770c9
Author: Holden Karau <[email protected]>
Date: 2015-12-14T20:28:03Z
Add properties for getting the std and mean on the StandardScalerModel
commit de3591f810fd140487d535375f26b1e9335caf2a
Author: Holden Karau <[email protected]>
Date: 2015-12-14T20:32:31Z
add withStd and withMean so people can check what kind of scaling the model
was trained for
commit 8450e4628ce8bbd23f2cb6a72c2f62613b896c5e
Author: Holden Karau <[email protected]>
Date: 2015-12-14T20:59:20Z
Is targetted for 2.0.0
----
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]