On Thu, 3 Sep 2009 13:20:16 -0700 (PDT)
ll_oz_ll <[email protected]> wrote:

> 
> Hi,
> Is hadoop able to take into account multi core nodes, so that nodes
> which have multiple cores run multiple concurrent jobs ?
> Or does that need to be configured manually and if so can that be
> configured individually for each node ?

Yes, it has to be configured manually. You set the following two
configuration variables in hadoop-site.xml on each node depending on the
number of cores on the node:

mapred.tasktracker.map.tasks.maximum
mapred.tasktracker.reduce.tasks.maximum

According to the book "Hadoop - the definitive guide", a good rule of
thumb is to have between 1 and 2 tasks per processor, counting both map
and reduce tasks. So, for example, if a machine has 8 cores, setting
mapred.tasktracker.map.tasks.maximum = 8 and
mapred.tasktracker.reduce.tasks.maximum = 8 probably makes sense, but
this also depends a bit on your load.

Cheers,
\EF
-- 
Erik Forsberg <[email protected]>
Developer, Opera Software - http://www.opera.com/

Reply via email to