[ 
https://issues.apache.org/jira/browse/MAPREDUCE-3490?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13175872#comment-13175872
 ] 

Arun C Murthy commented on MAPREDUCE-3490:
------------------------------------------

Sharad, sorry, I should have explained better.

The problem computing success/failed tasks in the RMContainerAllocator is that 
they only know Container status (at best exit code) and not whether the task 
completed successfully or not. Thus, we'd have to duplicate logic to compute 
tasks' status from container statuses there, something I'd like to avoid. 

Thus, my proposal to rely on JobImpl (either via events or just a direct 
function call). Makes sense? 

Thoughts?
                
> RMContainerAllocator counts failed maps towards Reduce ramp up
> --------------------------------------------------------------
>
>                 Key: MAPREDUCE-3490
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3490
>             Project: Hadoop Map/Reduce
>          Issue Type: Bug
>          Components: mr-am, mrv2
>    Affects Versions: 0.23.0
>            Reporter: Siddharth Seth
>            Assignee: Arun C Murthy
>            Priority: Blocker
>         Attachments: MAPREDUCE-3490.patch, MAPREDUCE-3490.patch, 
> MAPREDUCE-3490.patch, MAPREDUCE-3490.patch, MR-3490-alternate.patch
>
>
> The RMContainerAllocator does not differentiate between failed and successful 
> maps while calculating whether reduce tasks are ready to launch. Failed tasks 
> are also counted towards total completed tasks. 
> Example. 4 failed maps, 10 total maps. Map%complete = 4/14 * 100 instead of 
> being 0.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to