Rephrasing again... I would recommend hat you work off of Mahout 0.7 or the trunk (and not mahout-0.6).
________________________________ From: Suneel Marthi <[email protected]> To: "[email protected]" <[email protected]> Sent: Friday, May 24, 2013 8:02 AM Subject: Re: running lda in mahout I would suggest that you work off of Mahout 0.7 or the trunk (and not mahout-0.6). How many datapoints do you have in those 40k files? Depending on how many points u r trying to cluster and also given your setup the long execution times are possible. ________________________________ From: parnab kumar <[email protected]> To: [email protected] Sent: Friday, May 24, 2013 5:59 AM Subject: running lda in mahout Hi , I am using mahout 0.6 in its default settings i.e i am not using any hadoop cluster . I am running it in developer mode . To test LDA i used around 40k files which i converted to sequence file format vectors . I tried to test with 20 iterations . It is taking more than 3 hrs to complete the 20 iterations. I donot understand why it is taking so much time . Is it natural for it to take so much time without using a cluster . I am using a CPU with 2 processors and 2 gb of ram. Thanks, Parnab
