GitHub user holdenk opened a pull request:
https://github.com/apache/spark/pull/6139
[Spark-7511] pyspark ml seed param should be random by default or 42 is
quite funny but not very random
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/holdenk/spark
SPARK-7511-pyspark-ml-seed-param-should-be-random-by-default-or-42-is-quite-funny-but-not-very-random
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/spark/pull/6139.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #6139
----
commit 1ff17c240ef98a939827699522b2250a019c8676
Author: Holden Karau <[email protected]>
Date: 2015-05-14T01:56:57Z
Make the seed random for HasSeed in python
commit 213a54306d653994491ba841e735d595ac3c6027
Author: Holden Karau <[email protected]>
Date: 2015-05-14T02:00:40Z
Simplify the generated code to only include set default if there is a
default rather than having None is note None in the generated code
commit 0e3797e48c92791f529f1879336a795acbac3502
Author: Holden Karau <[email protected]>
Date: 2015-05-14T02:50:55Z
Make seed default to random in more places
----
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]