[
https://issues.apache.org/jira/browse/MESOS-8469?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16338104#comment-16338104
]
Greg Mann commented on MESOS-8469:
----------------------------------
Related test reviews:
https://reviews.apache.org/r/65315/
https://reviews.apache.org/r/65316/
> Mesos master might drop some events in the operator API stream
> --------------------------------------------------------------
>
> Key: MESOS-8469
> URL: https://issues.apache.org/jira/browse/MESOS-8469
> Project: Mesos
> Issue Type: Bug
> Reporter: Vinod Kone
> Assignee: Greg Mann
> Priority: Critical
> Fix For: 1.5.0
>
>
> Inside `Master::updateTask`, we call `Subscribers::send` which asynchronously
> calls `Subscribers::Subscriber::send` on each subscriber.
> But the problem is that inside `Subscribers:Subscriber::send` we are looking
> up the state of the master (e.g., getting Task* and Framework*) which might
> have changed between `Subscribers::send ` and `Subscribers::Subscriber::send`.
>
> For example, if a terminal task received an acknowledgement the task might be
> removed from master's state, causing us to drop the TASK_UPDATED event.
>
> We noticed this in an internal cluster, where a TASK_KILLED update was sent
> to one subscriber but not the other.
>
>
>
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)