Thanks Billy,I am trying 'nice', and will report the result later.

On Tue, May 12, 2009 at 3:42 AM, Billy Pearson
<sa...@pearsonwholesale.com>wrote:

> Might try setting the tasktrackers linux nice level to say 5 or 10
> leavening dfs and hbase setting to 0
>
> Billy
> "zsongbo" <zson...@gmail.com> wrote in message
> news:fa03480d0905110549j7f09be13qd434ca41c9f84...@mail.gmail.com...
>
>  Hi all,
>> Now, if we have a large dataset to process by MapReduce. The MapReduce
>> will
>> take machine resources as many as possible.
>>
>> So when one such a big MapReduce job are running, the cluster would become
>> very busy and almost cannot do anything else.
>>
>> For example, we have a HDFS+MapReduc+HBase cluster.
>> There are a large dataset in HDFS to be processed by MapReduce
>> periodically,
>> the workload is CPU and I/O heavy. And the cluster also provide other
>> service for query (query HBase and read files in HDFS). So, when the job
>> is
>> running, the query latency will become very long.
>>
>> Since the MapReduce job is not time sensitive, I want to control the load
>> of
>> MapReduce. Do you have some advices ?
>>
>> Thanks in advance.
>> Schubert
>>
>>
>
>

Reply via email to