Gastón Kleiman created MESOS-9674:
-------------------------------------

             Summary: Allocations can occurring between the multiple resource 
changes related to orphan operations.
                 Key: MESOS-9674
                 URL: https://issues.apache.org/jira/browse/MESOS-9674
             Project: Mesos
          Issue Type: Bug
          Components: allocation, master
    Affects Versions: 1.8.0
            Reporter: Gastón Kleiman


{{Master::updateSlave()}} was recently updated to handle orphan operations. The 
new code doesn't pause the allocator before doing multiple dispatches to the 
allocator.

This means that an allocation can be interleaved while the allocator is still 
not fully updated or in an inconsistent state.

I tried just to quickly improve the handler by adding a couple of 
{{allocator->pause()}} and {{allocator->resume()}} calls, but that made quite 
some SLRP tests get stuck and time out waiting for offers.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to