lachisis created YARN-4382: ------------------------------ Summary: Container hierarchy in cgroup may remain for ever after the container have be terminated Key: YARN-4382 URL: https://issues.apache.org/jira/browse/YARN-4382 Project: Hadoop YARN Issue Type: Bug Components: nodemanager Affects Versions: 2.5.2 Reporter: lachisis
If we use LinuxContainerExecutor to executor the containers, this question may happens. In the common case, when a container run, a corresponding hierarchy will be created in cgroup dir. And when the container terminate, the hierarchy will be delete in some seconds(this time can be configured by yarn.nodemanager.linux-container-executor.cgroups.delete-delay-ms). In the code, I find that, CgroupsLCEResource send a signal to kill container process asynchronously, and in the same time, it will try to delete the container hierarchy in configured "delete-delay-ms" times. But if the container process be killed for seconds which large than "delete-delay-ms" time, the container hierarchy will remain for ever. -- This message was sent by Atlassian JIRA (v6.3.4#6332)