[ https://issues.apache.org/jira/browse/MAHOUT-1470?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13942458#comment-13942458 ]
Suneel Marthi commented on MAHOUT-1470: --------------------------------------- Mahout's already get a LDAPrintTopics which prints the top K terms per topic. So this would basically transform the output of LdaTopics to replace topicId => topic and documentID => document ?? > Topic dump > ---------- > > Key: MAHOUT-1470 > URL: https://issues.apache.org/jira/browse/MAHOUT-1470 > Project: Mahout > Issue Type: New Feature > Components: Classification > Affects Versions: 1.0 > Reporter: Andrew Musselman > Priority: Minor > Fix For: 0.9 > > > Per > http://mail-archives.apache.org/mod_mbox/mahout-user/201403.mbox/%3CCAMc_qaL2DCgbVbam2miNsLpa4qvaA9sMy1-arccF9Nz6ApcsvQ%40mail.gmail.com%3E > > The script needs to be corrected to not call vectordump for LDA as > > vectordump utility (or even clusterdump) are presently not capable of > > displaying topics and relevant documents. I recall this issue was > > previously reported by Peyman Faratin post 0.9 release. > > > > Mahout's missing a clusterdump utility that reads in LDA > > topics, Document - DocumentId mapping and displays a report of the topics > > and the documents that belong to a topic. -- This message was sent by Atlassian JIRA (v6.2#6252)