[ 
https://issues.apache.org/jira/browse/MAPREDUCE-2168?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12927018#action_12927018
 ] 

Owen O'Malley commented on MAPREDUCE-2168:
------------------------------------------

There is already a limit in the code, so it shouldn't be doing that unless you 
increased the limit. 

Also be aware that the shuffle was completely re-written a year ago, so the 
version in 0.21 and trunk is very different from the version you are running.

> We should  implement limits on shuffle connections to TaskTracker per job
> -------------------------------------------------------------------------
>
>                 Key: MAPREDUCE-2168
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-2168
>             Project: Hadoop Map/Reduce
>          Issue Type: Improvement
>            Reporter: Liyin Liang
>
> As trailing map tasks will be attacked by all reduces simultaneously, all the 
> worker threads that for the http server of a TaskTracker may be occupied  by 
> one job's reduce tasks to fetch map outputs. Then this tasktracker's iowait 
> and load will be very high (100+ in our cluster, we set 
> tasktracker.http.threads with 100). What's more, other job's reduces have to 
> wait some time (may be several minutes) to connect to the TaskTracker to 
> fetch there map's outputs.
> So I think we should implement limits on shuffle connections:
> 1. limit the worker threads' number maybe percent  occupied  the same job's 
> reduces ;
> 2. limit the worker threads' number serving the same map output 
> simultaneously.
> Thoughts? 
> ps: we are using hadoop 0.19.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to