Jian He commented on YARN-3987:

bq. leaving too many completed container(AM container) in NM. 
At a single point of time,there should be only one AM instance in NM. Do you 
mean the old AM containers are not cleaned up ?

If AM cannot be launched, the AM will expire in 10 mins, in which case the 
containers should also be cleanedup.

> am container complete msg ack to NM once RM receive it
> ------------------------------------------------------
>                 Key: YARN-3987
>                 URL: https://issues.apache.org/jira/browse/YARN-3987
>             Project: Hadoop YARN
>          Issue Type: Bug
>          Components: resourcemanager
>            Reporter: sandflee
>            Assignee: sandflee
>         Attachments: YARN-3987.001.patch, YARN-3987.002.patch
> In our cluster we set max-am-attempts to a very very large num, and 
> unfortunately our am crash after launched, leaving too many completed 
> container(AM container) in NM.  completed container is removed from NM and 
> NMStateStore only if container complete is passed to AM, but if AM couldn't 
> be launched, the completed AM container couldn't be cleaned, and may eat up  
> NM heap memory.

This message was sent by Atlassian JIRA

Reply via email to