[
https://issues.apache.org/jira/browse/SPARK-7511?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Joseph K. Bradley updated SPARK-7511:
-------------------------------------
Description:
Currently, Scala's HasSeed mix-in uses a random Long as the default value for
seed. Python uses 42. After discussions, we've decided to use a seed which
varies based on the class name, but which is fixed instead of random. This
will make behavior reproducible, rather than random, by default. Users will
still be able to change the random seed.
Scala's seed will be fixed in a separate patch
was:Currently, Scala's HasSeed mix-in uses a random Long as the default value
for seed. Python should too. (Currently, it seems to use "42")
> PySpark ML seed Param should be varied per class
> ------------------------------------------------
>
> Key: SPARK-7511
> URL: https://issues.apache.org/jira/browse/SPARK-7511
> Project: Spark
> Issue Type: Improvement
> Components: ML, PySpark
> Reporter: Joseph K. Bradley
> Priority: Minor
>
> Currently, Scala's HasSeed mix-in uses a random Long as the default value for
> seed. Python uses 42. After discussions, we've decided to use a seed which
> varies based on the class name, but which is fixed instead of random. This
> will make behavior reproducible, rather than random, by default. Users will
> still be able to change the random seed.
> Scala's seed will be fixed in a separate patch
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]