[ 
https://issues.apache.org/jira/browse/YARN-4528?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15081342#comment-15081342
 ] 

sandflee commented on YARN-4528:
--------------------------------

[~jianhe] reviewing the code of how containers complete msg passed from RM to 
AM, seems there is a race condition that the message will be lost when msg 
pulled by AM (not really passed to AM) and AM crashed. we could fix this by put 
finishedContainersSentToAM to justFinishContainers when transfer state from 
previous RMAppAttempt. 

> decreaseContainer Message maybe lost if NM restart
> --------------------------------------------------
>
>                 Key: YARN-4528
>                 URL: https://issues.apache.org/jira/browse/YARN-4528
>             Project: Hadoop YARN
>          Issue Type: Bug
>            Reporter: sandflee
>         Attachments: YARN-4528.01.patch
>
>
> we may pending the container decrease msg util next heartbeat. or checks the 
> resource with rmContainer when node register.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to