Joseph K. Bradley created SPARK-6793:
----------------------------------------

             Summary: Implement prediction methods for LDA
                 Key: SPARK-6793
                 URL: https://issues.apache.org/jira/browse/SPARK-6793
             Project: Spark
          Issue Type: Improvement
          Components: MLlib
    Affects Versions: 1.3.0
            Reporter: Joseph K. Bradley


LDA does not currently support prediction on new datasets.  It should.  The 
prediction methods should include:
* Computing topic distributions for new documents
* Computing data metrics: log likelihood, perplexity

This task should probably be split up into sub-tasks for each prediction 
method, though we should think about whether code should be shared (and whether 
the return type should be able to produce all of these results since they 
require similar computation).



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to