GitHub user tgaloppo opened a pull request:

    https://github.com/apache/spark/pull/3854

    SPARK-5020 [MLlib] GaussianMixtureModel.predictMembership() should take an 
RDD only

    Removed unnecessary parameters to predictMembership()
    
    CC: @jkbradley


You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/tgaloppo/spark spark-5020

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/spark/pull/3854.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #3854
    
----
commit c15405c78345e9a46549a398c6b59bed80274f9e
Author: Travis Galoppo <[email protected]>
Date:   2014-10-30T18:50:47Z

    SPARK-4156

commit 5c96c576c5011f37fa7b84e88b7f1f8f48b82310
Author: Travis Galoppo <[email protected]>
Date:   2014-11-11T23:30:42Z

    Merge remote-tracking branch 'upstream/master'

commit c1a8e16ac9dc301d9eafca44100c5a857de903d5
Author: Travis Galoppo <[email protected]>
Date:   2014-11-11T23:31:44Z

    Made GaussianMixtureModel class serializable
    Modified sum function for better performance

commit 719d8cc119e4ce117d33732c72c6c42da7456ae4
Author: Travis Galoppo <[email protected]>
Date:   2014-11-13T01:23:53Z

    Added scala test suite with basic test

commit 86fb3829388484e51de46ffe96d237b63b752aeb
Author: Travis Galoppo <[email protected]>
Date:   2014-11-17T22:46:08Z

    Merge remote-tracking branch 'upstream/master'

commit e6ea8051a5e883121d650a5a8d7e79600cc642ed
Author: Travis Galoppo <[email protected]>
Date:   2014-11-18T00:52:12Z

    Merged with master branch; update test suite with latest context changes.
    Improved cluster initialization strategy.

commit 676e523a830020adf317d3c5bf445f94ed5c8a35
Author: Travis Galoppo <[email protected]>
Date:   2014-12-03T15:14:19Z

    Fixed to no longer ignore delta value provided on command line

commit 8aaa17d14a39d32a9164d044318d54f30df86202
Author: Travis Galoppo <[email protected]>
Date:   2014-12-03T15:15:01Z

    Added additional train() method to companion object for cluster count and 
tolerance parameters.
    
    Modified cluster initialization strategy to use an initial covariance 
matrix derived from the sample points used to initialize the mean.

commit 9770261fc67a56a3ef9178f77e42d4e0deecd433
Author: Travis Galoppo <[email protected]>
Date:   2014-12-12T17:04:15Z

    Corrected a variety of style and naming issues.

commit e7d413b8e7e8614097b0c3823652677f38062775
Author: Travis Galoppo <[email protected]>
Date:   2014-12-12T19:10:47Z

    Moved multivariate Gaussian utility class to mllib/stat/impl
    Improved comments

commit dc9c74264631a80dbb718a7be4c184ccae6277c0
Author: Travis Galoppo <[email protected]>
Date:   2014-12-12T19:12:36Z

    Moved MultivariateGaussian utility class

commit 97044cff25d7e7450296706669b1c1ba6cf692f0
Author: Travis Galoppo <[email protected]>
Date:   2014-12-16T01:09:46Z

    Fixed style issues

commit f407b4c22257a03409f4ab4dd15ed92738f58140
Author: FlytxtRnD <[email protected]>
Date:   2014-12-16T11:47:40Z

    Added predict() to return the cluster labels and membership values

commit b99ecc4d35bc548486d728741fc17658f07d83a5
Author: tgaloppo <[email protected]>
Date:   2014-12-16T12:32:51Z

    Merge pull request #1 from FlytxtRnD/predictBranch
    
    Added predict() to return the cluster labels and membership values

commit 2df336b6609ed4edc201ea6c26aa7b3faba4f284
Author: Travis Galoppo <[email protected]>
Date:   2014-12-16T12:37:48Z

    Fixed style issue

commit c3b8ce0fa808dbc95b6f19aee574c3f66863635e
Author: Travis Galoppo <[email protected]>
Date:   2014-12-16T12:38:02Z

    Merge branch 'master' of https://github.com/tgaloppo/spark
      Adds predict() method

commit d695034fab97c617b4c590896898235a8218dc6b
Author: Travis Galoppo <[email protected]>
Date:   2014-12-16T13:18:58Z

    Fixed style issues

commit 9be2534d67b0ac0f43ca083c0940afa81474e458
Author: Travis Galoppo <[email protected]>
Date:   2014-12-16T13:32:45Z

    Style issue

commit 8b633f33a1a9b0ebc0f0af84bc2164cbd199bcbc
Author: Travis Galoppo <[email protected]>
Date:   2014-12-16T13:34:30Z

    Style issue

commit 42b2142ed2a7440869d75b3fc926a74d9d19e30b
Author: Travis Galoppo <[email protected]>
Date:   2014-12-17T16:15:17Z

    Added functionality to allow setting of GMM starting point.
    Added two cluster test to testing suite.

commit 20ebca18ee09b31d201f21658c7b464fd0f5fad7
Author: Travis Galoppo <[email protected]>
Date:   2014-12-17T16:20:10Z

    Removed unusued code

commit cff73e0de3c7dd68fe7a23db385459678a646099
Author: Travis Galoppo <[email protected]>
Date:   2014-12-17T20:22:49Z

    Replaced accumulators with RDD.aggregate

commit 308c8ad9d88d275f11be0d3733f016d3119f98ee
Author: Travis Galoppo <[email protected]>
Date:   2014-12-18T06:11:47Z

    Numerous changes to improve code

commit 227ad66302f0d1166b452dc70b7cd3784a6f81e3
Author: Travis Galoppo <[email protected]>
Date:   2014-12-18T13:18:18Z

    Moved prediction methods into model class.

commit 578c2d101f6e87d19850e75c066a685124a8b3ed
Author: Travis Galoppo <[email protected]>
Date:   2014-12-18T13:32:22Z

    Removed unused import

commit 1de73f399a28060890c8617f1abddad46b30c132
Author: Travis Galoppo <[email protected]>
Date:   2014-12-18T21:45:28Z

    Removed redundant array from array creation

commit b97fe00d17c9d6665bb63f1f0bcbd968a875dfe8
Author: Travis Galoppo <[email protected]>
Date:   2014-12-19T01:28:02Z

    Minor fixes and tweaks.

commit 9b2fc2a96f6c5f44dbb6601c5c2f89b78352bed2
Author: Travis Galoppo <[email protected]>
Date:   2014-12-20T00:34:15Z

    Style improvements
    Changed ExpectationSum to a private class

commit acf1fba6b0084511272cf4e19e0cab4587a11d16
Author: Travis Galoppo <[email protected]>
Date:   2014-12-22T14:26:28Z

    Fixed parameter comment in GaussianMixtureModel
    Made maximum iterations an optional parameter to DenseGmmEM

commit 709e4bf854984b2eb3e5ed36c8a15590edf7e891
Author: Travis Galoppo <[email protected]>
Date:   2014-12-22T18:32:35Z

    fixed usage line to include optional maxIterations parameter

----


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to