GitHub user tgaloppo opened a pull request:
https://github.com/apache/spark/pull/3854
SPARK-5020 [MLlib] GaussianMixtureModel.predictMembership() should take an
RDD only
Removed unnecessary parameters to predictMembership()
CC: @jkbradley
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/tgaloppo/spark spark-5020
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/spark/pull/3854.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #3854
----
commit c15405c78345e9a46549a398c6b59bed80274f9e
Author: Travis Galoppo <[email protected]>
Date: 2014-10-30T18:50:47Z
SPARK-4156
commit 5c96c576c5011f37fa7b84e88b7f1f8f48b82310
Author: Travis Galoppo <[email protected]>
Date: 2014-11-11T23:30:42Z
Merge remote-tracking branch 'upstream/master'
commit c1a8e16ac9dc301d9eafca44100c5a857de903d5
Author: Travis Galoppo <[email protected]>
Date: 2014-11-11T23:31:44Z
Made GaussianMixtureModel class serializable
Modified sum function for better performance
commit 719d8cc119e4ce117d33732c72c6c42da7456ae4
Author: Travis Galoppo <[email protected]>
Date: 2014-11-13T01:23:53Z
Added scala test suite with basic test
commit 86fb3829388484e51de46ffe96d237b63b752aeb
Author: Travis Galoppo <[email protected]>
Date: 2014-11-17T22:46:08Z
Merge remote-tracking branch 'upstream/master'
commit e6ea8051a5e883121d650a5a8d7e79600cc642ed
Author: Travis Galoppo <[email protected]>
Date: 2014-11-18T00:52:12Z
Merged with master branch; update test suite with latest context changes.
Improved cluster initialization strategy.
commit 676e523a830020adf317d3c5bf445f94ed5c8a35
Author: Travis Galoppo <[email protected]>
Date: 2014-12-03T15:14:19Z
Fixed to no longer ignore delta value provided on command line
commit 8aaa17d14a39d32a9164d044318d54f30df86202
Author: Travis Galoppo <[email protected]>
Date: 2014-12-03T15:15:01Z
Added additional train() method to companion object for cluster count and
tolerance parameters.
Modified cluster initialization strategy to use an initial covariance
matrix derived from the sample points used to initialize the mean.
commit 9770261fc67a56a3ef9178f77e42d4e0deecd433
Author: Travis Galoppo <[email protected]>
Date: 2014-12-12T17:04:15Z
Corrected a variety of style and naming issues.
commit e7d413b8e7e8614097b0c3823652677f38062775
Author: Travis Galoppo <[email protected]>
Date: 2014-12-12T19:10:47Z
Moved multivariate Gaussian utility class to mllib/stat/impl
Improved comments
commit dc9c74264631a80dbb718a7be4c184ccae6277c0
Author: Travis Galoppo <[email protected]>
Date: 2014-12-12T19:12:36Z
Moved MultivariateGaussian utility class
commit 97044cff25d7e7450296706669b1c1ba6cf692f0
Author: Travis Galoppo <[email protected]>
Date: 2014-12-16T01:09:46Z
Fixed style issues
commit f407b4c22257a03409f4ab4dd15ed92738f58140
Author: FlytxtRnD <[email protected]>
Date: 2014-12-16T11:47:40Z
Added predict() to return the cluster labels and membership values
commit b99ecc4d35bc548486d728741fc17658f07d83a5
Author: tgaloppo <[email protected]>
Date: 2014-12-16T12:32:51Z
Merge pull request #1 from FlytxtRnD/predictBranch
Added predict() to return the cluster labels and membership values
commit 2df336b6609ed4edc201ea6c26aa7b3faba4f284
Author: Travis Galoppo <[email protected]>
Date: 2014-12-16T12:37:48Z
Fixed style issue
commit c3b8ce0fa808dbc95b6f19aee574c3f66863635e
Author: Travis Galoppo <[email protected]>
Date: 2014-12-16T12:38:02Z
Merge branch 'master' of https://github.com/tgaloppo/spark
Adds predict() method
commit d695034fab97c617b4c590896898235a8218dc6b
Author: Travis Galoppo <[email protected]>
Date: 2014-12-16T13:18:58Z
Fixed style issues
commit 9be2534d67b0ac0f43ca083c0940afa81474e458
Author: Travis Galoppo <[email protected]>
Date: 2014-12-16T13:32:45Z
Style issue
commit 8b633f33a1a9b0ebc0f0af84bc2164cbd199bcbc
Author: Travis Galoppo <[email protected]>
Date: 2014-12-16T13:34:30Z
Style issue
commit 42b2142ed2a7440869d75b3fc926a74d9d19e30b
Author: Travis Galoppo <[email protected]>
Date: 2014-12-17T16:15:17Z
Added functionality to allow setting of GMM starting point.
Added two cluster test to testing suite.
commit 20ebca18ee09b31d201f21658c7b464fd0f5fad7
Author: Travis Galoppo <[email protected]>
Date: 2014-12-17T16:20:10Z
Removed unusued code
commit cff73e0de3c7dd68fe7a23db385459678a646099
Author: Travis Galoppo <[email protected]>
Date: 2014-12-17T20:22:49Z
Replaced accumulators with RDD.aggregate
commit 308c8ad9d88d275f11be0d3733f016d3119f98ee
Author: Travis Galoppo <[email protected]>
Date: 2014-12-18T06:11:47Z
Numerous changes to improve code
commit 227ad66302f0d1166b452dc70b7cd3784a6f81e3
Author: Travis Galoppo <[email protected]>
Date: 2014-12-18T13:18:18Z
Moved prediction methods into model class.
commit 578c2d101f6e87d19850e75c066a685124a8b3ed
Author: Travis Galoppo <[email protected]>
Date: 2014-12-18T13:32:22Z
Removed unused import
commit 1de73f399a28060890c8617f1abddad46b30c132
Author: Travis Galoppo <[email protected]>
Date: 2014-12-18T21:45:28Z
Removed redundant array from array creation
commit b97fe00d17c9d6665bb63f1f0bcbd968a875dfe8
Author: Travis Galoppo <[email protected]>
Date: 2014-12-19T01:28:02Z
Minor fixes and tweaks.
commit 9b2fc2a96f6c5f44dbb6601c5c2f89b78352bed2
Author: Travis Galoppo <[email protected]>
Date: 2014-12-20T00:34:15Z
Style improvements
Changed ExpectationSum to a private class
commit acf1fba6b0084511272cf4e19e0cab4587a11d16
Author: Travis Galoppo <[email protected]>
Date: 2014-12-22T14:26:28Z
Fixed parameter comment in GaussianMixtureModel
Made maximum iterations an optional parameter to DenseGmmEM
commit 709e4bf854984b2eb3e5ed36c8a15590edf7e891
Author: Travis Galoppo <[email protected]>
Date: 2014-12-22T18:32:35Z
fixed usage line to include optional maxIterations parameter
----
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]