I installed mahout-0.6 from the trunk using SVN and built it completely (even all the tests). It took a while, but, now, everything seems fine. The vectorized output are good (tfidf-vectors) and I am able to run Kmeans clustering properly with no issues. Finally, able to run clusterdump as well.
Thanks to all for suggestions. /PD On Wed, Dec 21, 2011 at 6:31 PM, Ted Dunning <[email protected]> wrote: > Yes. Completely concur. All of the other steps look just fine. > > On Wed, Dec 21, 2011 at 12:56 PM, Periya.Data <[email protected]> > wrote: > > > Again, I am using Mahout -0.5, Hadoop 0.20.2-cdh3u2. All this is for > > tracking why my kmeans clustering is not working and giving the > > indexoutofboudsexception...and it looks like this tfidf-vector generation > > maybe the culprit.. > > >
