Qian Zhang created MESOS-8876:
---------------------------------

             Summary: Normal exit of Docker container using rexray volume 
results in TASK_FAILED
                 Key: MESOS-8876
                 URL: https://issues.apache.org/jira/browse/MESOS-8876
             Project: Mesos
          Issue Type: Bug
          Components: docker
            Reporter: Qian Zhang
            Assignee: Qian Zhang
         Attachments: task_exit_volume.json

In the fix to  MESOS-8488, we reap the Docker container process directly in 
Docker executor, and it will wait for `docker run` to return for at most 3 
seconds. However, in some cases, the `docker run` command will indeed need more 
than 3 seconds to return, e.g., the Docker container uses an external rexray 
volume (see the attached task json as an example), for such container, there 
will be about 5 seconds between container process exits and the `docker run` 
returns (I suspect Docker daemon was doing some stuff related to rexray volume 
during this time), so we will reap this container, and send a {{TASK_FAILED}}.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to