[ 
https://issues.apache.org/jira/browse/MESOS-808?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Dominic Hamon updated MESOS-808:
--------------------------------
    Labels: twitter v1_api  (was: v1_api)

> The scheduler driver should queue messages when disconnected or return 
> delivery status.
> ---------------------------------------------------------------------------------------
>
>                 Key: MESOS-808
>                 URL: https://issues.apache.org/jira/browse/MESOS-808
>             Project: Mesos
>          Issue Type: Improvement
>            Reporter: Benjamin Mahler
>              Labels: twitter, v1_api
>
> Currently when schedulers try to take an action while the driver is 
> disconnected (i.e. a call to SchedulerDriver::disconnected has occurred), the 
> driver will drop the request.
> In the case of launching a task, we'll reply with TASK_LOST directly in the 
> driver. However, with things like killTask, we simply drop the kill task 
> request.
> This behavior seems a little unfriendly for schedulers, as they need to be 
> concerned about queueing any operations until Scheduler::connected is called. 
> We should consider queuing in the driver instead.
> The implementation here can consist of a queue<Message> holding the messages 
> that were constructed while !connected. Once we re-connect, we simply run 
> through this queue sending all messages.
> However, without state in the driver, schedulers will have to live with the 
> possibility of dropped messages anyway (i.e. if they fail while disconnected, 
> any messages will be lost).
> Therefore, an alternative here is possible when we add a v1 API. If we return 
> a Future or other form of status we can indicate whether the message was 
> sent. This is definitely simpler and more reliable than queueing.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to