[ 
https://issues.apache.org/jira/browse/MESOS-9847?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Greg Mann reassigned MESOS-9847:
--------------------------------

    Assignee: Andrei Budnik

> Docker executor doesn't wait for status updates to be ack'd before shutting 
> down.
> ---------------------------------------------------------------------------------
>
>                 Key: MESOS-9847
>                 URL: https://issues.apache.org/jira/browse/MESOS-9847
>             Project: Mesos
>          Issue Type: Bug
>          Components: executor
>            Reporter: Meng Zhu
>            Assignee: Andrei Budnik
>            Priority: Major
>              Labels: containerization
>
> The docker executor doesn't wait for pending status updates to be 
> acknowledged before shutting down, instead it sleeps for one second and then 
> terminates:
> {noformat}
>   void _stop()
>   {
>     // A hack for now ... but we need to wait until the status update
>     // is sent to the slave before we shut ourselves down.
>     // TODO(tnachen): Remove this hack and also the same hack in the
>     // command executor when we have the new HTTP APIs to wait until
>     // an ack.
>     os::sleep(Seconds(1));
>     driver.get()->stop();
>   }
> {noformat}
> This would result in racing between task status update (e.g. TASK_FINISHED) 
> and executor exit. The latter would lead agent generating a `TASK_FAILED` 
> status update by itself, leading to the confusing case where the agent 
> handles two different terminal status updates.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to