Benjamin Mahler created MESOS-6941:
--------------------------------------

             Summary: Add support for batch processing of status updates, to 
increase latency / throughput / cluster scalability.
                 Key: MESOS-6941
                 URL: https://issues.apache.org/jira/browse/MESOS-6941
             Project: Mesos
          Issue Type: Improvement
          Components: agent, framework api, master
            Reporter: Benjamin Mahler


Currently, each task has its own independent status update stream. Within an 
individual stream, updates are sent to schedulers in a serial fashion: the 
agent will send the N+1th status update only after it receives the scheduler's 
acknowledgement for the Nth status update.

This approach slows down throughput substantially and has the potential to 
backlog status updates when they occur rapidly. Rather, we should add the 
ability for all available updates on a stream to be sent together (there should 
probably be a limit to the size of this "batch") so that the scheduler can 
process them together without incurring the round trip acknowledgement latency 
in between each update.

In addition, there may be cases of updates where the scheduler only wants the 
latest information (e.g. download status per MESOS-2256, or possibly health 
information). But this should be tackled separately.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to