On Wed, Nov 30, 2011 at 1:03 PM, Isabel Drost <[email protected]> wrote:
> On 28.11.2011 bish maten wrote: > > mahout ldatopics -i mahout-work/abc/abc-lda/state-20 -d > > mahout-work/abc/abc-out-seqdir-sparse-lda/dictionary.file-0 -dt > > sequencefile (there were no errors reported and command worked fine with > > following output). Does the output appear ok? > > Hmm - this only prints the resulting LDA topics - which command did you > use to > generate them? > > Please also note that Jake is currently working on improving our LDA > support, if > you are interested in that algorithm it might be interesting for you to > look > into his patch in https://issues.apache.org/jira/browse/MAHOUT-897 Yeah, I'm also working on moving away from LDATopic altogether, instead using VectorDumper + dictionary file and grabbing top N weighted elements in the vector representing the topic. We already do this internally at Twitter, I just have to get that particular patch formatted properly and cleaned up once MAHOUT-897 gets committed (which will hopefully be this week). -jake
