Timothy Chen created MESOS-2115:
-----------------------------------

             Summary: Improve recovering Docker containers when slave is 
contained
                 Key: MESOS-2115
                 URL: https://issues.apache.org/jira/browse/MESOS-2115
             Project: Mesos
          Issue Type: Improvement
            Reporter: Timothy Chen


Currently when docker containerizer is recovering it checks the checkpointed 
executor pids to recover which containers are still running, and remove the 
rest of the containers from docker ps that isn't recognized.

This is problematic when the slave itself was in a docker container, as when 
the slave container dies all the forked processes are removed as well, so the 
checkpointed executor pids are no longer valid.

We have to assume the docker containers might be still running even though the 
checkpointed executor pids are not.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to