[ 
https://issues.apache.org/jira/browse/SPARK-5567?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14578595#comment-14578595
 ] 

yuhao yang commented on SPARK-5567:
-----------------------------------

Hi Joseph, just to be clear. If we're using the MAP prediction you mentioned, 
does it require "fold-in" Gibbs sampling(and convergence) in the prediction 
process, or just straightforward summation?

I checked the implementation in 
https://github.com/mimno/Mallet/blob/master/src/cc/mallet/topics/TopicInferencer.java#L81.
 Is it something aligned with your idea? 



> Add prediction methods to LDA
> -----------------------------
>
>                 Key: SPARK-5567
>                 URL: https://issues.apache.org/jira/browse/SPARK-5567
>             Project: Spark
>          Issue Type: Improvement
>          Components: MLlib
>    Affects Versions: 1.3.0
>            Reporter: Joseph K. Bradley
>
> LDA currently supports prediction on the training set.  E.g., you can call 
> logLikelihood and topicDistributions to get that info for the training data.  
> However, it should support the same functionality for new (test) documents.
> This will require inference but should be able to use the same code, with a 
> few modification to keep the inferred topics fixed.
> Note: The API for these methods is already in the code but is commented out.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to