lachisis created YARN-4382:
------------------------------

             Summary: Container hierarchy in cgroup may remain for ever after 
the container have be terminated
                 Key: YARN-4382
                 URL: https://issues.apache.org/jira/browse/YARN-4382
             Project: Hadoop YARN
          Issue Type: Bug
          Components: nodemanager
    Affects Versions: 2.5.2
            Reporter: lachisis


If we use LinuxContainerExecutor to executor the containers, this question may 
happens.
In the common case, when a container run, a corresponding hierarchy will be 
created in cgroup dir. And when the container terminate, the hierarchy  will be 
delete in some seconds(this time can be configured by 
yarn.nodemanager.linux-container-executor.cgroups.delete-delay-ms).

In the code, I find that, CgroupsLCEResource send a signal to kill container 
process asynchronously, and in the same time, it will try to delete the 
container hierarchy  in configured "delete-delay-ms" times. 
But if the container process be killed for seconds which large than 
"delete-delay-ms" time, the  container hierarchy  will remain for ever.




  



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to