sandflee created YARN-3987:

             Summary: am container complete msg ack to NM once RM receive it
                 Key: YARN-3987
             Project: Hadoop YARN
          Issue Type: Bug
          Components: resourcemanager
            Reporter: sandflee

In our cluster we set max-am-attempts to a very very large num, and 
unfortunately our am crash after launched, leaving too many completed 
container(AM container) in NM.  completed container is removed from NM and 
NMStateStore only if container complete is passed to AM, but if AM couldn't be 
launched, the completed AM container couldn't be cleaned, and may eat up  NM 
heap memory.

This message was sent by Atlassian JIRA

Reply via email to