[ 
https://issues.apache.org/jira/browse/HADOOP-3297?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12594808#action_12594808
 ] 

Mahadev konar commented on HADOOP-3297:
---------------------------------------

the patch looks good... the only concern I have is that if we want to check and 
see that it does not degrade perofrmace and lead to more problems at the 
jobtracker or otherwise.
We could try running sort with this patch or some map reduce job where we have 
a huge number of mappers say 100,000 and 500 reducers or something. 

This is just to verify that the task tracker being agressive in fetching the 
mapoutputs does not degrade performance. The maps are short lived, so its not 
hard to imagine a situation that all the reduces start bombarding the 
jobtracker with requests for maps at the same time asking for more. We should 
check to see if the jobtracker can handle the load and the performance in such 
a situation does not degrade. 

> The way in which ReduceTask/TaskTracker gets completion events during shuffle 
> can be improved
> ---------------------------------------------------------------------------------------------
>
>                 Key: HADOOP-3297
>                 URL: https://issues.apache.org/jira/browse/HADOOP-3297
>             Project: Hadoop Core
>          Issue Type: Improvement
>          Components: mapred
>            Reporter: Devaraj Das
>            Assignee: Devaraj Das
>             Fix For: 0.18.0
>
>         Attachments: 3297.patch, 3297.patch
>
>
> Certain things like poll frequency, number of events fetched in one go, etc. 
> can probably be improved to improve the shuffle performance. This would 
> affect the task-->tasktracker and the tasktracker-->jobtracker shuffle 
> related RPCs.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to