GitHub user jkbradley opened a pull request:

    https://github.com/apache/spark/pull/3981

    [SPARK-5015] [mllib] Random seed for GMM + make test suite deterministic

    Issues:
    * From JIRA: GaussianMixtureEM uses randomness but does not take a random 
seed. It should take one as a parameter.
    * This also makes the test suite flaky since initialization can fail due to 
stochasticity.
    
    Fix:
    * Add random seed
    * Use it in test suite
    
    CC: @mengxr  @tgaloppo 

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/jkbradley/spark gmm-seed

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/spark/pull/3981.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #3981
    
----
commit f0df4fd786d7bc552e6ecd2447d8f814a98eb34b
Author: Joseph K. Bradley <[email protected]>
Date:   2015-01-09T18:39:50Z

    Added seed parameter to GMM.  Updated test suite to use seed to prevent 
flakiness

----


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to