Re: How to do load control of MapReduce

Billy Pearson Mon, 11 May 2009 12:43:12 -0700

Might try setting the tasktrackers linux nice level to say 5 or 10 leaveningdfs and hbase setting to 0


Billy

"zsongbo" <zson...@gmail.com> wrote in messagenews:fa03480d0905110549j7f09be13qd434ca41c9f84...@mail.gmail.com...

Hi all,
Now, if we have a large dataset to process by MapReduce. The MapReducewill
take machine resources as many as possible.
So when one such a big MapReduce job are running, the cluster would become
very busy and almost cannot do anything else.

For example, we have a HDFS+MapReduc+HBase cluster.
There are a large dataset in HDFS to be processed by MapReduceperiodically,
the workload is CPU and I/O heavy. And the cluster also provide other
service for query (query HBase and read files in HDFS). So, when the jobis
running, the query latency will become very long.
Since the MapReduce job is not time sensitive, I want to control the loadof
MapReduce. Do you have some advices ?

Thanks in advance.
Schubert

Re: How to do load control of MapReduce

Reply via email to