[ 
https://issues.apache.org/jira/browse/MAPREDUCE-5848?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jason Lowe updated MAPREDUCE-5848:
----------------------------------

     Target Version/s: 2.5.0
    Affects Version/s: 2.1.0-beta

Moving this to MAPREDUCE since it's an issue with the MaoReduce AM and not YARN.

Note that the small patch may not be sufficient to completely solve the issue 
of preempted tasks being interpreted as FAILED rather than KILLED.  Due to 
races between the AM heartbeat to the RM and the AM receiving task status 
updates via the task umbilical the AM may still think that a task attempt 
failed even though it was preempted.  When a task attempt receives SIGTERM from 
the NM it causes the FileSystem to close via the shutdown hook and often causes 
exceptions within the task.  Those exceptions are reported as task failure via 
the task umbilical.  If that status arrives at the AM before the AM receives 
the preempted container status from the RM then the AM counts it as a task 
failure rather than a task preempt.

> MapReduce counts forcibly preempted containers as FAILED
> --------------------------------------------------------
>
>                 Key: MAPREDUCE-5848
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-5848
>             Project: Hadoop Map/Reduce
>          Issue Type: Bug
>    Affects Versions: 2.1.0-beta
>            Reporter: Carlo Curino
>            Assignee: Subramaniam Krishnan
>         Attachments: YARN-1958.patch
>
>
> The MapReduce AM is considering a forcibly preempted container as FAILED, 
> while I think it should be considered as KILLED (i.e., not count against the 
> maximum number of failures). 



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to