Benjamin Mahler created MESOS-1409:
--------------------------------------

             Summary: Send status update acknowledgments through the Master.
                 Key: MESOS-1409
                 URL: https://issues.apache.org/jira/browse/MESOS-1409
             Project: Mesos
          Issue Type: Task
    Affects Versions: 0.19.0
            Reporter: Benjamin Mahler


Currently status update acknowledgments are sent directly from the scheduler 
driver to the slave.

This has worked well in the sense that it removes the additional overhead of 
processing acknowledgments through the master.

However, in order to provide correct reconciliation of tasks, the master needs 
to hold on to terminal *unacknowledged* tasks to ensure that it can correctly 
identify these until the framework receives the terminal update. Without this, 
we are susceptible to MESOS-1389.

There will be additional traffic through the master from acknowledgments that 
we need to be aware of. However, most of the scaling bottlenecks we have seen 
in the past are in the scheduler driver's queue when we send messages faster 
than the framework can process them in a persistent manner. With MESOS-1408, I 
suspect that we will have a smaller number of update retries in the presence of 
a heavy load on a scheduler, even with the acknowledgments going through the 
master.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to