GitHub user feynmanliang opened a pull request:

    https://github.com/apache/spark/pull/7705

    [Spark-6793][MLlib] OnlineLDAOptimizer LDA perplexity

    Implements `logPerplexity` in `OnlineLDAOptimizer`. Also refactors 
inference code into companion object to enable future reuse (e.g. `predict` 
method).

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/feynmanliang/spark SPARK-6793-perplexity

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/spark/pull/7705.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #7705
    
----
commit aead650cc2f36c8781b70ef6b377395440688837
Author: Feynman Liang <[email protected]>
Date:   2015-07-16T23:50:03Z

    Range slice, in-place update, reduce transposes

commit c62cb1e46d3950226251bad3cfb3cb18ab012e50
Author: Feynman Liang <[email protected]>
Date:   2015-07-17T00:28:58Z

    Outer product for stats, revert Range slicing

commit 7f62a55a43544b6eca31afaf0fe8f4d0abbaa3b8
Author: Feynman Liang <[email protected]>
Date:   2015-07-17T08:16:50Z

    --amend

commit cd521d61ce40d174971590973b536c9edd72f469
Author: Feynman Liang <[email protected]>
Date:   2015-07-18T20:25:27Z

    Refactor methods into companion class

commit f8adc483647ab1dd45cbe44b6a3015fc58e6ece4
Author: Feynman Liang <[email protected]>
Date:   2015-07-18T20:34:43Z

    Add logPerplexity, refactor variationalBound into a method

commit 11e7c336ac4f1ecbd2b7eac6b5231e55fc705067
Author: Feynman Liang <[email protected]>
Date:   2015-07-27T23:20:16Z

    Merge remote-tracking branch 'apache/master' into SPARK-6793-perplexity
    
    * apache/master: (218 commits)
      [SPARK-9385] [PYSPARK] Enable PEP8 but disable installing pylint.
      [SPARK-4352] [YARN] [WIP] Incorporate locality preferences in dynamic 
allocation requests
      [SPARK-9385] [HOT-FIX] [PYSPARK] Comment out Python style check
      [SPARK-8988] [YARN] Make sure driver log links appear in secure cluste…
      [SPARK-9355][SQL] Remove InternalRow.get generic getter call in columnar 
cache code
      [SPARK-9378] [SQL] Fixes test case "CTAS with serde"
      [SPARK-9349] [SQL] UDAF cleanup
      Closes #7690 since it has been merged into branch-1.4.
      [HOTFIX] Disable pylint since it is failing master.
      [SPARK-9369][SQL] Support IntervalType in UnsafeRow
      [SPARK-9351] [SQL] remove literals from grouping expressions in Aggregate
      [SPARK-7423] [MLLIB] Modify ClassificationModel and Probabalistic model 
to use Vector.argmax
      [SPARK-9376] [SQL] use a seed in RandomDataGeneratorSuite
      [SPARK-9366] use task's stageAttemptId in TaskEnd event
      [SPARK-9364] Fix array out of bounds and use-after-free bugs in 
UnsafeExternalSorter
      Pregel example type fix
      [SPARK-4176] [SQL] Supports decimal types with precision > 18 in Parquet
      [SPARK-8405] [DOC] Add how to view logs on Web UI when yarn log 
aggregation is enabled
      [SPARK-7943] [SPARK-8105] [SPARK-8435] [SPARK-8714] [SPARK-8561] Fixes 
multi-database support
      [SPARK-9371][SQL] fix the support for special chars in column names for 
hive context
      ...

----


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to