Meng Zhu created MESOS-9777:
-------------------------------

             Summary: Consider doing an internal retry if reservation and etc. 
operations fail due to 409 conflict.
                 Key: MESOS-9777
                 URL: https://issues.apache.org/jira/browse/MESOS-9777
             Project: Mesos
          Issue Type: Improvement
          Components: master
            Reporter: Meng Zhu


A reservation request may return 409 Conflict:

https://github.com/apache/mesos/blob/261d6ef497383795557aaca5dce426b4482eabea/src/master/http.cpp#L4026

It is due to the inherent race between the master and allocator actor. As 
illustrated here:

https://github.com/apache/mesos/blob/261d6ef497383795557aaca5dce426b4482eabea/src/master/allocator/mesos/hierarchical.cpp#L992-L1008

This is not ideal and should be rare. However, it is hard for users to grasp 
this error. It seems to be beneficial for Mesos to retry the reservation 
operation internally for the user.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to