reduce task hanging or just slow?

Colin Freas Mon, 31 Mar 2008 12:08:18 -0700

I've set up a job to run on my small 4 (sometimes 5) node cluster on dual
processor server boxes with 2-8GB of memory.


My job processes 24 100-300MB files that are a days worth of logs, total
data is about 6GB.

I've modified the word count example to do what I need, and it works fine on
small test files.

I've set the number of map tasks at 200, the number of reduce tasks to 14.
Things seem to go along fine, the map % climbs nicely, along with the
reduce.  Once the map hits 100% though, the reduce % stops increasing.
Right now it's stuck around 58%.  I was hoping changing the number of reduce
tasks would help, but I'm not really sure it did.  I had tried this once
before with the default number of deduce jobs, and I got to 100% (Map) and
14% (Reduce) before I saw this hanging behavior.

I'm just trying to understand what's happening here, and if there's
something I can do to increase the performance, short of adding nodes.  Is
it likely I've set something up incorrectly somewhere?

Any help appreciated.

Thanks!

-Colin

reduce task hanging or just slow?

Reply via email to