Joseph K. Bradley created SPARK-9246:
----------------------------------------
Summary: DistributedLDAModel predict top docs per topic
Key: SPARK-9246
URL: https://issues.apache.org/jira/browse/SPARK-9246
Project: Spark
Issue Type: New Feature
Components: MLlib
Reporter: Joseph K. Bradley
For each topic, return top documents based on topicDistributions.
Synopsis:
{code}
/**
* @param maxDocuments Max docs to return for each topic
* @return Array over topics of (sorted top docs, corresponding doc-topic
weights)
*/
def topDocumentsPerTopic(maxDocuments: Int): Array[(Array[Long], Array[Double])]
{code}
Note: We will need to make sure that the above return value format is
Java-friendly.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]