RJ Nowling created SPARK-6522:
---------------------------------

             Summary: Standardize Random Number Generation
                 Key: SPARK-6522
                 URL: https://issues.apache.org/jira/browse/SPARK-6522
             Project: Spark
          Issue Type: Improvement
          Components: Spark Core
    Affects Versions: 1.3.0
            Reporter: RJ Nowling
            Priority: Minor


Generation of random numbers in Spark has to be handled carefully since 
references to RNGs copy the state to the workers.  As such, a separate RNG 
needs to be seeded for each partition.  Each time random numbers are used in 
Spark's libraries, the RNG seeding is re-implemented, leaving open the 
possibility of mistakes.

It would be useful if RNG seeding was standardized through utility functions or 
random number generation functions that can be called in Spark pipelines.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to