Joseph K. Bradley created SPARK-6793:
----------------------------------------
Summary: Implement prediction methods for LDA
Key: SPARK-6793
URL: https://issues.apache.org/jira/browse/SPARK-6793
Project: Spark
Issue Type: Improvement
Components: MLlib
Affects Versions: 1.3.0
Reporter: Joseph K. Bradley
LDA does not currently support prediction on new datasets. It should. The
prediction methods should include:
* Computing topic distributions for new documents
* Computing data metrics: log likelihood, perplexity
This task should probably be split up into sub-tasks for each prediction
method, though we should think about whether code should be shared (and whether
the return type should be able to produce all of these results since they
require similar computation).
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]