dzcxzl created YARN-10462:
-----------------------------
Summary: Configurable shutdown cleanup slop
Key: YARN-10462
URL: https://issues.apache.org/jira/browse/YARN-10462
Project: Hadoop YARN
Issue Type: Improvement
Components: nodemanager
Affects Versions: 3.1.0
Reporter: dzcxzl
When stopping NM or decommission NM, stopping all containers, the waiting time
is composed of three values
sleep-delay-before-sigkill+process-kill-wait+SHUTDOWN_CLEANUP_SLOP_MS (constant
1000)
yarn.nodemanager.sleep-delay-before-sigkill.ms=250
yarn.nodemanager.process-kill-wait.ms=5000
SHUTDOWN_CLEANUP_SLOP_MS=1000
The parameters of sleep-delay-before-sigkill and process-kill-wait are the time
to kill a container/process. When there are too many container lists to be
killed, it is usually not completely killed.
We can make SHUTDOWN_CLEANUP_SLOP_MS a configurable parameter, so that in some
scenarios, we can wait as long as possible to kill all containers to complete.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]