[ 
https://issues.apache.org/jira/browse/YARN-4277?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14984585#comment-14984585
 ] 

sandflee commented on YARN-4277:
--------------------------------

Is there any plan to store NM info?  [~jlowe] [~djp] [~jianhe],  we could just 
store NM info not containers running on NM.
Without NM info, 
1,  containers could be leaked as in  this issue. 
2,  AM knows nothing if nm crashed forever and RM failover

> containers would be leaked if nm crashed  and rm failover
> ---------------------------------------------------------
>
>                 Key: YARN-4277
>                 URL: https://issues.apache.org/jira/browse/YARN-4277
>             Project: Hadoop YARN
>          Issue Type: Bug
>            Reporter: sandflee
>
> nm restart and rm ha is enabled.
> 1,  nm crashed, after timeout, rm send container complete msg to 
> corresponding AM.
> 2, rm failovers
> 3, nm restart and register to RM , recovering containers running on NM, these 
> containers and leaked.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to