[ 
https://issues.apache.org/jira/browse/YARN-9345?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

liyakun updated YARN-9345:
--------------------------
    Issue Type: Improvement  (was: New Feature)

> NM actively does not accept new containers in the heartbeat
> -----------------------------------------------------------
>
>                 Key: YARN-9345
>                 URL: https://issues.apache.org/jira/browse/YARN-9345
>             Project: Hadoop YARN
>          Issue Type: Improvement
>          Components: nodemanager
>            Reporter: liyakun
>            Assignee: liyakun
>            Priority: Major
>
> At present, NM has only one health check mechanism. If it enters an unhealthy 
> state, all the containers running on it will be killed.
>  However, the unhealthy condition of node can be divided into two types, one 
> is long-term unavailable (current health mechanism), and the other is only 
> temporary pressure.
>  For temporary stress, node only needs to wait for a while to return to 
> normal (such as temporary load high).
>  To do this, we need to extend the functionality of the health check to join 
> the state of temporarily not accepting new tasks(do not kill the container 
> that is already running).



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to