[ https://issues.apache.org/jira/browse/MAPREDUCE-3927?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13289349#comment-13289349 ]
Bhallamudi Venkata Siva Kamesh commented on MAPREDUCE-3927: ----------------------------------------------------------- Thanks Deva and Jason for the reviews. @Jason I *think*, to get *true* progess the map/reduce phase,we should return *1f*, when a map/reduce task is finished (SUCCEEDED or FAILED or KILLED). However, in the current code base, progress of the failed/killed map/reduce task will be < *1f*. {code:title=Proposed Change: JobImpl#computeProgress|borderStyle=solid} if (task.getType() == TaskType.MAP) { mapProgress += (task.isFinished() ? 1f : task.getProgress()); } else { reduceProgress += (task.isFinished() ? 1f : task.getProgress()); } {code} > Shuffle hang when set map.failures.percent > ------------------------------------------ > > Key: MAPREDUCE-3927 > URL: https://issues.apache.org/jira/browse/MAPREDUCE-3927 > Project: Hadoop Map/Reduce > Issue Type: Bug > Components: mrv2 > Affects Versions: 0.23.3, 2.0.0-alpha, 2.0.1-alpha, trunk > Reporter: MengWang > Assignee: Bhallamudi Venkata Siva Kamesh > Priority: Critical > Labels: patch > Attachments: MAPREDUCE-3927-1.patch, MAPREDUCE-3927-2.patch, > MAPREDUCE-3927.patch, MAPREDUCE-3927.patch > > > When set mapred.max.map.failures.percent and there does have some failed > maps, then shuffle will hang -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira