[ https://issues.apache.org/jira/browse/YARN-3194?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14321213#comment-14321213 ]
Jian He commented on YARN-3194: ------------------------------- bq. RM(RMNodeImpl.AddNodeTransition#transition) is processing only RUNNING containers. COMPLETED containers are ignored. Completed containers are also processed. please refer to {{RMContainerImpl#ContainerRecoveredTransition}. Both running and completed containers sent by NM on re-registration will be processed by the new RM and routed back to the AM. > After NM restart,completed containers are not released which are sent during > NM registration > -------------------------------------------------------------------------------------------- > > Key: YARN-3194 > URL: https://issues.apache.org/jira/browse/YARN-3194 > Project: Hadoop YARN > Issue Type: Bug > Components: resourcemanager > Affects Versions: 2.6.0 > Environment: NM restart is enabled > Reporter: Rohith > Assignee: Rohith > Priority: Blocker > > On NM restart ,NM sends all the outstanding NMContainerStatus to RM. But RM > process only ContainerState.RUNNING. If container is completed when NM was > down then those containers resources wont be release which result in > applications to hang. -- This message was sent by Atlassian JIRA (v6.3.4#6332)