liyakun created YARN-9345:
-----------------------------

             Summary: NM actively does not accept new containers in the 
heartbeat
                 Key: YARN-9345
                 URL: https://issues.apache.org/jira/browse/YARN-9345
             Project: Hadoop YARN
          Issue Type: New Feature
          Components: nodemanager
            Reporter: liyakun
            Assignee: liyakun


At present, NM has only one health check mechanism. If it enters an unhealthy 
state, all the containers running on it will be killed.
However, the unhealthy condition of node can be divided into two types, one is 
long-term unavailable (current health mechanism), and the other is only 
temporary pressure.
For temporary stress, node only needs to wait for a while to return to normal 
(such as temporary load high).
To do this, we need to extend the functionality of the health check to join the 
state of temporarily not accepting new tasks.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to