Chun-Hung Hsiao created MESOS-9580:
--------------------------------------
Summary: Master sends inconsistent `UpdateFrameworkMessage` to
agents.
Key: MESOS-9580
URL: https://issues.apache.org/jira/browse/MESOS-9580
Project: Mesos
Issue Type: Bug
Components: master
Reporter: Chun-Hung Hsiao
If a framework reregisters with a new user, the master would ignore the user
update because of MESOS-703:
[https://github.com/apache/mesos/blob/f1dc50568dcc90cec7158205dca86a2398a42dcd/src/master/framework.cpp#L526-L529]
However, it would send the *original* framework info provided by the framework
to all agents:
[https://github.com/apache/mesos/blob/f1dc50568dcc90cec7158205dca86a2398a42dcd/src/master/master.cpp#L2748-L2757
https://github.com/apache/mesos/blob/f1dc50568dcc90cec7158205dca86a2398a42dcd/src/master/master.cpp#L3156-L3162|https://github.com/apache/mesos/blob/f1dc50568dcc90cec7158205dca86a2398a42dcd/src/master/master.cpp#L2748-L2757]
But, when an agent reregistered, the master would send the framework info from
its in-memory state:
[https://github.com/apache/mesos/blob/f1dc50568dcc90cec7158205dca86a2398a42dcd/src/master/master.cpp#L7827-L7842]
This would make the framework info inconsistent between the master and some of
its agents. Although it won't affect executor and task launch (as the framework
info would be injected into {{RunTask(Group)*Message}}), if there is a master
failover, a race between framework and agent reregistrations would make the new
master learn different framework info.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)