[ 
https://issues.apache.org/jira/browse/YARN-4277?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14964387#comment-14964387
 ] 

sandflee commented on YARN-4277:
--------------------------------

yes, this's a problem in our cluster, our NM hangs for a long time because some 
thread we added not exit and we have not merge YARN-3585.  this happended  
weeks ago,  logs are lost.



> containers would be leaked if nm crashed  and rm failover
> ---------------------------------------------------------
>
>                 Key: YARN-4277
>                 URL: https://issues.apache.org/jira/browse/YARN-4277
>             Project: Hadoop YARN
>          Issue Type: Bug
>            Reporter: sandflee
>
> nm restart and rm ha is enabled.
> 1,  nm crashed, after timeout, rm send container complete msg to 
> corresponding AM.
> 2, rm failovers
> 3, nm restart and register to RM , recovering containers running on NM, these 
> containers and leaked.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to