[
https://issues.apache.org/jira/browse/MESOS-6482?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15606701#comment-15606701
]
Yan Xu commented on MESOS-6482:
-------------------------------
The root cause is likely MESOS-4975: when frameworks are torn down, they are
removed from {{frameworks.registered}} and {{frameworks.recovered}} but a bunch
of spurious entries are left in {{slave->tasks}}.
> Master check failure when marking an agent unreachable
> ------------------------------------------------------
>
> Key: MESOS-6482
> URL: https://issues.apache.org/jira/browse/MESOS-6482
> Project: Mesos
> Issue Type: Bug
> Components: master
> Affects Versions: 1.1.0
> Reporter: Yan Xu
> Priority: Blocker
>
> {noformat:title=}
> I1025 16:34:55.423038 44118 master.cpp:6006] Marked agent
> 8e219f7a-06c1-4009-9440-1a33b3e39be5-S473 (x.y.z.com) unreachable: health
> check timed out
> F1025 16:34:55.423632 44118 master.cpp:6036] Check failed:
> frameworks.recovered.contains(frameworkId)
> {noformat}
> Both the master and the agent are on 1.1.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)