Timothy Chen created MESOS-2601: ----------------------------------- Summary: Tasks are not removed after recovery from slave and mesos containerizer Key: MESOS-2601 URL: https://issues.apache.org/jira/browse/MESOS-2601 Project: Mesos Issue Type: Bug Components: containerization, slave Affects Versions: 0.22.1 Reporter: Timothy Chen
We've seen in our test cluster that tasks that were launched with the mesos containerizer are recovered after slave restart, but actual command process is not running anymore and the checkpointed executor is not marked as completed. The Mesos containerizer recovers and all the isolators couldn't recover the task, but the containerizer itself is somehow never removed and the monitor kept calling usage on the containerizer. -- This message was sent by Atlassian JIRA (v6.3.4#6332)