Chun-Hung Hsiao created MESOS-9580:
--------------------------------------

             Summary: Master sends inconsistent `UpdateFrameworkMessage` to 
agents.
                 Key: MESOS-9580
                 URL: https://issues.apache.org/jira/browse/MESOS-9580
             Project: Mesos
          Issue Type: Bug
          Components: master
            Reporter: Chun-Hung Hsiao


If a framework reregisters with a new user, the master would ignore the user 
update because of MESOS-703:
[https://github.com/apache/mesos/blob/f1dc50568dcc90cec7158205dca86a2398a42dcd/src/master/framework.cpp#L526-L529]

However, it would send the *original* framework info provided by the framework 
to all agents:
[https://github.com/apache/mesos/blob/f1dc50568dcc90cec7158205dca86a2398a42dcd/src/master/master.cpp#L2748-L2757
https://github.com/apache/mesos/blob/f1dc50568dcc90cec7158205dca86a2398a42dcd/src/master/master.cpp#L3156-L3162|https://github.com/apache/mesos/blob/f1dc50568dcc90cec7158205dca86a2398a42dcd/src/master/master.cpp#L2748-L2757]

But, when an agent reregistered, the master would send the framework info from 
its in-memory state:
[https://github.com/apache/mesos/blob/f1dc50568dcc90cec7158205dca86a2398a42dcd/src/master/master.cpp#L7827-L7842]

This would make the framework info inconsistent between the master and some of 
its agents. Although it won't affect executor and task launch (as the framework 
info would be injected into {{RunTask(Group)*Message}}), if there is a master 
failover, a race between framework and agent reregistrations would make the new 
master learn different framework info.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to