If you want k-means speed see the new k-means code:
https://github.com/tdunning/knn

Can you describe your data a bit?

On Sat, Nov 10, 2012 at 11:22 AM, pricila rr <[email protected]> wrote:

> I am running kmeans algorithm.
> Increasing the number of tasktrackers and datanodes, increase the speed?
>
> Thank you
>
> 2012/11/10 Dmitriy Lyubimov <[email protected]>
>
> > I would imagine optimizing Mahout jobs are not fundamentally different
> from
> > optiimizing any Hadoop job. Make sure you have optimal amount of task per
> > node configured, as well as optimal amount of memory to prevent GC
> > thrashing. (Iterative Mahout batches tend to create GC churn at somewhat
> > respectable rate). When optimized correctly, individual Mahout tasks tend
> > to be CPU bound.
> >
> > Could you tell which Mahout method specifically you are talking about?
> >
> >
> > On Sat, Nov 10, 2012 at 11:11 AM, pricila rr <[email protected]>
> wrote:
> >
> > > Hello,
> > > How to run jobs on Hadoop-Mahout, using processor full capacity?
> > > I have 10 slaves and 1 master, with i5 CPU. But the jobs Hadoop-Mahout
> > not
> > > use all this capacity.
> > >
> > > Thank you,
> > > Pricila
> > >
> >
>

Reply via email to