[ https://issues.apache.org/jira/browse/MESOS-9847?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Greg Mann reassigned MESOS-9847: -------------------------------- Assignee: Andrei Budnik > Docker executor doesn't wait for status updates to be ack'd before shutting > down. > --------------------------------------------------------------------------------- > > Key: MESOS-9847 > URL: https://issues.apache.org/jira/browse/MESOS-9847 > Project: Mesos > Issue Type: Bug > Components: executor > Reporter: Meng Zhu > Assignee: Andrei Budnik > Priority: Major > Labels: containerization > > The docker executor doesn't wait for pending status updates to be > acknowledged before shutting down, instead it sleeps for one second and then > terminates: > {noformat} > void _stop() > { > // A hack for now ... but we need to wait until the status update > // is sent to the slave before we shut ourselves down. > // TODO(tnachen): Remove this hack and also the same hack in the > // command executor when we have the new HTTP APIs to wait until > // an ack. > os::sleep(Seconds(1)); > driver.get()->stop(); > } > {noformat} > This would result in racing between task status update (e.g. TASK_FINISHED) > and executor exit. The latter would lead agent generating a `TASK_FAILED` > status update by itself, leading to the confusing case where the agent > handles two different terminal status updates. -- This message was sent by Atlassian Jira (v8.3.4#803005)