[ 
https://issues.apache.org/jira/browse/MAPREDUCE-4228?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13269155#comment-13269155
 ] 

alex gemini commented on MAPREDUCE-4228:
----------------------------------------

It's a little confusing,if mapreduce.job.reduce.slowstart.completedmaps didn't 
control when reduce will start scheduling,should we just remove this parameter? 
or maybe change name to mapreduce.job.reduce.slowstart.startedmaps?
                
> mapreduce.job.reduce.slowstart.completedmaps is not working properly to delay 
> the scheduling of the reduce tasks
> ----------------------------------------------------------------------------------------------------------------
>
>                 Key: MAPREDUCE-4228
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4228
>             Project: Hadoop Map/Reduce
>          Issue Type: Bug
>          Components: applicationmaster, mrv2
>    Affects Versions: 0.23.1
>            Reporter: Jason Lowe
>            Assignee: Jason Lowe
>
> If no more map tasks need to be scheduled but not all have completed, the 
> ApplicationMaster will start scheduling reducers even if the number of 
> completed maps has not met the mapreduce.job.reduce.slowstart.completedmaps 
> threshold.  For example, if the property is set to 1.0 all maps should 
> complete before any reducers are scheduled.  However the reducers are 
> scheduled as soon as the last map task is assigned to a container.  For a job 
> with very long-running maps, a cluster with enough capacity to launch all map 
> tasks could cause reducers to launch prematurely and waste cluster resources.
> Thanks to Phil Su for discovering this issue.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to