sandflee commented on YARN-4277:

yes, this's a problem in our cluster, our NM hangs for a long time because some 
thread we added not exit and we have not merge YARN-3585.  this happended  
weeks ago,  logs are lost.

> containers would be leaked if nm crashed  and rm failover
> ---------------------------------------------------------
>                 Key: YARN-4277
>                 URL: https://issues.apache.org/jira/browse/YARN-4277
>             Project: Hadoop YARN
>          Issue Type: Bug
>            Reporter: sandflee
> nm restart and rm ha is enabled.
> 1,  nm crashed, after timeout, rm send container complete msg to 
> corresponding AM.
> 2, rm failovers
> 3, nm restart and register to RM , recovering containers running on NM, these 
> containers and leaked.

This message was sent by Atlassian JIRA

Reply via email to