Thanks Srivastava for your reply. the size of the vocabulary is about 300-400K, and the topicNum is set to 100 or 150. it is OK?
On Fri, Aug 22, 2014 at 7:00 PM, vaibhav srivastava <[email protected]> wrote: > What is your dictionary size. Lot of things depend on it. When we calculate > final probability > On 22 Aug 2014 14:27, "Wei Li" <[email protected]> wrote: > > > Hi All: > > > > I have successfully compiled the Mahout 0.9 on Hadoop and submit the > > LDA CVB model, most of the parameters are set to default values and the > > --maxIter is set to 25. After we got the model, we found that the word > > probability in each topic is quite small, most of them are about 0.00001 > > equally, is it OK? can we change the alpha and beta to change the word > > distribution in each topic? thanks all :) > > >
