[ 
https://issues.apache.org/jira/browse/YARN-3194?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14321213#comment-14321213
 ] 

Jian He commented on YARN-3194:
-------------------------------

bq. RM(RMNodeImpl.AddNodeTransition#transition) is processing only RUNNING 
containers. COMPLETED containers are ignored.
Completed containers are also processed. please refer  to 
{{RMContainerImpl#ContainerRecoveredTransition}.
Both running and completed containers sent by NM on re-registration will be 
processed by the new RM and routed back to the AM.

> After NM restart,completed containers are not released which are sent during 
> NM registration
> --------------------------------------------------------------------------------------------
>
>                 Key: YARN-3194
>                 URL: https://issues.apache.org/jira/browse/YARN-3194
>             Project: Hadoop YARN
>          Issue Type: Bug
>          Components: resourcemanager
>    Affects Versions: 2.6.0
>         Environment: NM restart is enabled
>            Reporter: Rohith
>            Assignee: Rohith
>            Priority: Blocker
>
> On NM restart ,NM sends all the outstanding NMContainerStatus to RM. But RM 
> process only ContainerState.RUNNING. If container is completed when NM was 
> down then those containers resources wont be release which result in 
> applications to hang.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to