[ 
https://issues.apache.org/jira/browse/MAPREDUCE-1463?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12830406#action_12830406
 ] 

Todd Lipcon commented on MAPREDUCE-1463:
----------------------------------------

I think the transferred logic is wrong. Shouldn't it be:
{code}
return numMapTasks <= reduceRushMapsThreshold ||
            numReduceTasks <= reduceRushReducesThreshold ||
            finishedMapTasks >= completedMapsForReduceSlowstart;
{code}

Also, I'm not sure that the design is quite right. If I have 1 map but 200 
reduces, I don't want to rush the reduces, do I? That is to say, should the 
condition be && between the two rush parameters, or ||?

> Reducer should start faster for smaller jobs
> --------------------------------------------
>
>                 Key: MAPREDUCE-1463
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-1463
>             Project: Hadoop Map/Reduce
>          Issue Type: Improvement
>          Components: contrib/fair-share
>            Reporter: Scott Chen
>            Assignee: Scott Chen
>         Attachments: MAPREDUCE-1463-v1.patch, MAPREDUCE-1463-v2.patch
>
>
> Our users often complain about the slowness of smaller ad-hoc jobs.
> The overhead to wait for the reducers to start in this case is significant.
> It will be good if we can start the reducer sooner in this case.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to