[ 
https://issues.apache.org/jira/browse/MAHOUT-1470?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13942458#comment-13942458
 ] 

Suneel Marthi commented on MAHOUT-1470:
---------------------------------------

Mahout's already get a LDAPrintTopics which prints the top K terms per topic. 
So this would basically transform the output of LdaTopics to replace topicId => 
topic and documentID => document ??

> Topic dump
> ----------
>
>                 Key: MAHOUT-1470
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-1470
>             Project: Mahout
>          Issue Type: New Feature
>          Components: Classification
>    Affects Versions: 1.0
>            Reporter: Andrew Musselman
>            Priority: Minor
>             Fix For: 0.9
>
>
> Per 
> http://mail-archives.apache.org/mod_mbox/mahout-user/201403.mbox/%3CCAMc_qaL2DCgbVbam2miNsLpa4qvaA9sMy1-arccF9Nz6ApcsvQ%40mail.gmail.com%3E
> > The script needs to be corrected to not call vectordump for LDA as
> > vectordump utility (or even clusterdump) are presently not capable of
> > displaying topics and relevant documents. I recall this issue was
> > previously reported by Peyman Faratin post 0.9 release.
> >
> > Mahout's missing a clusterdump utility that reads in LDA
> > topics, Document - DocumentId mapping and displays a report of the topics
> > and the documents that belong to a topic.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to