Hello, If you are using version 0.5 you may consider a shift to 0.6 for that issue. I'm not sure if there is a patch already for that problem, but seems to problem in LDAPrintTopics (cf. http://osdir.com/ml/general/2011-11/msg14635.html).
Best, --Omar On Fri, Jun 29, 2012 at 7:57 AM, S.Sudarshan <[email protected]> wrote: > Hello, > > I have been following the Mahout-In-Action book to learn mahout. Its a > great book. I am at the section where I am trying to run the LDA algorithm > to the reuters data. However, regardless of the number of ti mes I run it, > I only get one topic (Topic-0) when I run LDAPrintTopics on the state-20. I > ran the command as indicated: > > mahout lda -i reuters-vectors/tf-vectors -o reuters-lda-sparse -k 10 > -v 34262 -x 20 -ow > > Topic 0 > =========== > billion [p(billion|topic_0) = 0.04580929884162013 > pct [p(pct|topic_0) = 0.043323700764985575 > dlrs [p(dlrs|topic_0) = 0.031395871939373196 > 3 [p(3|topic_0) = 0.027311386657272094 > 1987 [p(1987|topic_0) = 0.025690077982656934 > 1 [p(1|topic_0) = 0.022727304049111215 > reuter [p(reuter|topic_0) = 0.019572283708227903 > mln [p(mln|topic_0) = 0.014569551610736616 > april [p(april|topic_0) = 0.014453636611524965 > march [p(march|topic_0) = 0.014359948846622552 > > Could someone help me with this ? > > > Thanks. -- Omar U. Florez Doctoral student in Computer Science Utah State University Cell: +1 (435) 757-9354 Web: http://omarflorez.info
