GitHub user jkbradley opened a pull request:
https://github.com/apache/spark/pull/12723
[SPARK-11940][PYSPARK][ML] Python API for ml.clustering.LDA PR2
## What changes were proposed in this pull request?
pyspark.ml API for LDA
* LDA, LDAModel, LocalLDAModel, DistributedLDAModel
* includes persistence
This replaces [https://github.com/apache/spark/pull/10242]
## How was this patch tested?
* doc test for LDA, including Param setters
* unit test for persistence
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/jkbradley/spark zjffdu-SPARK-11940
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/spark/pull/12723.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #12723
----
commit c0367f46994bf36e7f7a6baf9c8a2cc11fabb3c0
Author: Jeff Zhang <[email protected]>
Date: 2015-12-04T14:50:08Z
[SPARK-11940][PYSPARK] Python API for ml.clustering.LDA
commit 417de17ff91e2c0930d9e6b45d03c5d6600274a8
Author: Jeff Zhang <[email protected]>
Date: 2016-04-20T03:51:59Z
address comments
commit 66f265f1f639868fa897ff92b6f8b438ce90e094
Author: Jeff Zhang <[email protected]>
Date: 2016-04-20T05:34:24Z
code style fix
commit 09d5ca74e7a3195c15a6b29dd27b1604cee209be
Author: Jeff Zhang <[email protected]>
Date: 2016-04-22T01:59:42Z
address comments
commit 4f9bdaa89832ca2514c54d6782f9ffce72222db2
Author: Joseph K. Bradley <[email protected]>
Date: 2016-04-26T23:52:24Z
added type converter and used set instead of paramMap directly
commit 0d1292459c9a27807c7d07d12af90ec3d32df831
Author: Joseph K. Bradley <[email protected]>
Date: 2016-04-27T01:16:40Z
remaining PR cleanups, plus fixing use of :py:attr:
commit 4f807e82cf4366ea83557aa6060b14245c7f5764
Author: Joseph K. Bradley <[email protected]>
Date: 2016-04-27T02:35:00Z
Added persistence to LDA in Python
----
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]