Reduce Performance

Ross Boucher Fri, 21 Sep 2007 12:02:27 -0700

I've been running a program to count search terms in log files, whichis basically a small modification of the wordcount program. Thisdoesn't have a reduce phase, so the only tasks for the reduce jobs toperform is sorting the output files of the map jobs.

My cluster has 4 machines on it, so based on the recommendations onthe wiki, I set my reduce count to 8. Unfortunately, the performancewas less than ideal. Specifically, when the map functions hadfinished, I had to wait an additional 40% of the total job time justfor copying/sorting the files. I know for a fact that the sort isvery fast, so the only remaining question is why moving the filesaround takes so long.

Looking at the jobtracker webapp, I noticed that the reduce->copyingphase listed under the job showed a transfer speed of 0.01MB/s, whichis fairly slow. The machines are connected on a gigabit switch, anduploading 5GB of files to the hdfs system (hadoop dfs copyFromLocal)only takes about a minute.

Finally, when I set the reduce count to the number of machines,performance is good, since one reduce task will start up right away,and the slow transfers will continue throughout the map phase, and beready almost immediately at the end of the map phase.

If anyone has some suggestions on how I might be able to increaseperformance, or what might be going on in this scenario, I wouldappreciate the tips. I'd be happy to provide some more details aboutthe setup if needed, for the moment its more of a testing ground tosee what my options are.


Thanks.

Ross Boucher
[EMAIL PROTECTED]

Reduce Performance

Reply via email to