GitHub user feynmanliang opened a pull request:
https://github.com/apache/spark/pull/7705
[Spark-6793][MLlib] OnlineLDAOptimizer LDA perplexity
Implements `logPerplexity` in `OnlineLDAOptimizer`. Also refactors
inference code into companion object to enable future reuse (e.g. `predict`
method).
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/feynmanliang/spark SPARK-6793-perplexity
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/spark/pull/7705.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #7705
----
commit aead650cc2f36c8781b70ef6b377395440688837
Author: Feynman Liang <[email protected]>
Date: 2015-07-16T23:50:03Z
Range slice, in-place update, reduce transposes
commit c62cb1e46d3950226251bad3cfb3cb18ab012e50
Author: Feynman Liang <[email protected]>
Date: 2015-07-17T00:28:58Z
Outer product for stats, revert Range slicing
commit 7f62a55a43544b6eca31afaf0fe8f4d0abbaa3b8
Author: Feynman Liang <[email protected]>
Date: 2015-07-17T08:16:50Z
--amend
commit cd521d61ce40d174971590973b536c9edd72f469
Author: Feynman Liang <[email protected]>
Date: 2015-07-18T20:25:27Z
Refactor methods into companion class
commit f8adc483647ab1dd45cbe44b6a3015fc58e6ece4
Author: Feynman Liang <[email protected]>
Date: 2015-07-18T20:34:43Z
Add logPerplexity, refactor variationalBound into a method
commit 11e7c336ac4f1ecbd2b7eac6b5231e55fc705067
Author: Feynman Liang <[email protected]>
Date: 2015-07-27T23:20:16Z
Merge remote-tracking branch 'apache/master' into SPARK-6793-perplexity
* apache/master: (218 commits)
[SPARK-9385] [PYSPARK] Enable PEP8 but disable installing pylint.
[SPARK-4352] [YARN] [WIP] Incorporate locality preferences in dynamic
allocation requests
[SPARK-9385] [HOT-FIX] [PYSPARK] Comment out Python style check
[SPARK-8988] [YARN] Make sure driver log links appear in secure clusteâ¦
[SPARK-9355][SQL] Remove InternalRow.get generic getter call in columnar
cache code
[SPARK-9378] [SQL] Fixes test case "CTAS with serde"
[SPARK-9349] [SQL] UDAF cleanup
Closes #7690 since it has been merged into branch-1.4.
[HOTFIX] Disable pylint since it is failing master.
[SPARK-9369][SQL] Support IntervalType in UnsafeRow
[SPARK-9351] [SQL] remove literals from grouping expressions in Aggregate
[SPARK-7423] [MLLIB] Modify ClassificationModel and Probabalistic model
to use Vector.argmax
[SPARK-9376] [SQL] use a seed in RandomDataGeneratorSuite
[SPARK-9366] use task's stageAttemptId in TaskEnd event
[SPARK-9364] Fix array out of bounds and use-after-free bugs in
UnsafeExternalSorter
Pregel example type fix
[SPARK-4176] [SQL] Supports decimal types with precision > 18 in Parquet
[SPARK-8405] [DOC] Add how to view logs on Web UI when yarn log
aggregation is enabled
[SPARK-7943] [SPARK-8105] [SPARK-8435] [SPARK-8714] [SPARK-8561] Fixes
multi-database support
[SPARK-9371][SQL] fix the support for special chars in column names for
hive context
...
----
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]