[ 
https://issues.apache.org/jira/browse/SPARK-7511?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Joseph K. Bradley updated SPARK-7511:
-------------------------------------
    Description: 
Currently, Scala's HasSeed mix-in uses a random Long as the default value for 
seed.  Python uses 42.  After discussions, we've decided to use a seed which 
varies based on the class name, but which is fixed instead of random.  This 
will make behavior reproducible, rather than random, by default.  Users will 
still be able to change the random seed.

Scala's seed will be fixed in a separate patch

  was:Currently, Scala's HasSeed mix-in uses a random Long as the default value 
for seed.  Python should too.  (Currently, it seems to use "42")


> PySpark ML seed Param should be varied per class
> ------------------------------------------------
>
>                 Key: SPARK-7511
>                 URL: https://issues.apache.org/jira/browse/SPARK-7511
>             Project: Spark
>          Issue Type: Improvement
>          Components: ML, PySpark
>            Reporter: Joseph K. Bradley
>            Priority: Minor
>
> Currently, Scala's HasSeed mix-in uses a random Long as the default value for 
> seed.  Python uses 42.  After discussions, we've decided to use a seed which 
> varies based on the class name, but which is fixed instead of random.  This 
> will make behavior reproducible, rather than random, by default.  Users will 
> still be able to change the random seed.
> Scala's seed will be fixed in a separate patch



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to