It seems likely that you are only running one (single-threaded) map or reduce 
operation per worker node. Do you know whether you are in fact running multiple 
operations?

This also sounds like it may be a manifestation of a question that I have seen 
a lot on the mailing lists lately, which is that people do not know how to 
increase the number of task slots in their tasktracker configuration.  This 
setting is normally controlled via the setting 
mapred.tasktracker.{map|reduce}.tasks.maximum in mapred-site.xml.  The default 
of 2 each is probably too low for your servers.


----- Original Message -----
From: Ratner, Alan S (IS) <[email protected]>
To: [email protected] <[email protected]>
Sent: Wed Sep 15 09:47:47 2010
Subject: Making optimum use of cores

I'm running Hadoop 0.20.2 on a cluster of servers running Ubuntu 10.4.
Each server has 2 quad-core Nehalem CPUs for a total of 8 physical cores
running as 16 virtual cores.  Ubuntu's System Monitor displays 16
squiggly lines showing usage of the 16 virtual cores.  We only seem to
be making use of one of the 16 virtual cores on any slave node and even
on the master node only one virtual core is significantly busy at a
time.  Is there a way to make better use of the cores?  Presumably I
could run Hadoop in a VM assigned to each virtual core but I would think
there must be a more elegant solution.

Alan Ratner

Reply via email to