[
https://issues.apache.org/jira/browse/YARN-4277?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14964387#comment-14964387
]
sandflee commented on YARN-4277:
--------------------------------
yes, this's a problem in our cluster, our NM hangs for a long time because some
thread we added not exit and we have not merge YARN-3585. this happended
weeks ago, logs are lost.
> containers would be leaked if nm crashed and rm failover
> ---------------------------------------------------------
>
> Key: YARN-4277
> URL: https://issues.apache.org/jira/browse/YARN-4277
> Project: Hadoop YARN
> Issue Type: Bug
> Reporter: sandflee
>
> nm restart and rm ha is enabled.
> 1, nm crashed, after timeout, rm send container complete msg to
> corresponding AM.
> 2, rm failovers
> 3, nm restart and register to RM , recovering containers running on NM, these
> containers and leaked.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)