dzcxzl created YARN-10462:
-----------------------------

             Summary: Configurable shutdown cleanup slop
                 Key: YARN-10462
                 URL: https://issues.apache.org/jira/browse/YARN-10462
             Project: Hadoop YARN
          Issue Type: Improvement
          Components: nodemanager
    Affects Versions: 3.1.0
            Reporter: dzcxzl


When stopping NM or decommission NM, stopping all containers, the waiting time 
is composed of three values 
sleep-delay-before-sigkill+process-kill-wait+SHUTDOWN_CLEANUP_SLOP_MS (constant 
1000)

yarn.nodemanager.sleep-delay-before-sigkill.ms=250
yarn.nodemanager.process-kill-wait.ms=5000
SHUTDOWN_CLEANUP_SLOP_MS=1000

The parameters of sleep-delay-before-sigkill and process-kill-wait are the time 
to kill a container/process. When there are too many container lists to be 
killed, it is usually not completely killed.

We can make SHUTDOWN_CLEANUP_SLOP_MS a configurable parameter, so that in some 
scenarios, we can wait as long as possible to kill all containers to complete.

 

 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to