GitHub user mmenestret opened a pull request:
https://github.com/apache/spark/pull/7997
SPARK-9690 Adding the possibility to set the seed of the rand in the â¦
â¦CrossValidator fold
The fold in the ML CrossValidator depends on a rand whose seed is set to 0
and it leads the sql.functions rand to call sc._jvm.functions.rand() with no
seed.
In order to be able to unit test a Cross Validation it would be a good idea
to be able to set this seed so the output of the cross validation (with a
featureSubsetStrategy set to "all") would always be the same.
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/mmenestret/spark
SPARK-9690-CrossValidator-Random-Seed
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/spark/pull/7997.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #7997
----
commit 480a5d081fe34bea42dc3c474a57a811947f483f
Author: Martin MENESTRET <[email protected]>
Date: 2015-08-06T16:28:48Z
SPARK-9690 Adding the possibility to set the seed of the rand in the
CrossValidator fold
----
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]