Joris Van Remoortere created MESOS-3420:
-------------------------------------------
Summary: Resolve shutdown semantics for Machine/Down
Key: MESOS-3420
URL: https://issues.apache.org/jira/browse/MESOS-3420
Project: Mesos
Issue Type: Task
Reporter: Joris Van Remoortere
When an operator uses the {{machine/down}} endpoint, the master sends a
shutdown message to the agent.
We need to discuss and resolve the semantics that we want regarding the
operators and frameworks knowing when their tasks are terminated.
One option is to explicitly remove the agent from the master which will send
the {{TASK_LOST}} updates and {{SlaveLostMessage}} directly from the master.
The concern around this is that during a network partition, or if the agent was
down at the time, that these tasks could still be running.
This is a general problem related to task life-times being dissociated with
that life-time of the agent.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)