Rephrasing again...

I would recommend hat you work off of Mahout 0.7 or the trunk (and not 
mahout-0.6). 



________________________________
 From: Suneel Marthi <[email protected]>
To: "[email protected]" <[email protected]> 
Sent: Friday, May 24, 2013 8:02 AM
Subject: Re: running lda in mahout
 


I would suggest that you work off of Mahout 0.7 or the trunk (and not 
mahout-0.6). 
How many datapoints do you have in those 40k files? 
Depending on how many points u r trying to cluster and also given your setup 
the long execution times are possible.
 




________________________________
 From: parnab kumar <[email protected]>
To: [email protected] 
Sent: Friday, May 24, 2013 5:59 AM
Subject: running lda in mahout
 

Hi ,
             I am using mahout 0.6 in its default settings i.e i am not
using any hadoop cluster . I am running it in developer mode . To test LDA
i used around 40k files which i converted to sequence file format vectors
. I tried to test with 20 iterations . It is taking more than 3 hrs to
complete the 20 iterations. I donot understand why it is taking so much
time . Is it natural for it to take so much time without using a cluster .
I am using a CPU with 2 processors and 2 gb of ram.

Thanks,
Parnab

Reply via email to