[ https://issues.apache.org/jira/browse/MESOS-6482?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15606701#comment-15606701 ]
Yan Xu commented on MESOS-6482: ------------------------------- The root cause is likely MESOS-4975: when frameworks are torn down, they are removed from {{frameworks.registered}} and {{frameworks.recovered}} but a bunch of spurious entries are left in {{slave->tasks}}. > Master check failure when marking an agent unreachable > ------------------------------------------------------ > > Key: MESOS-6482 > URL: https://issues.apache.org/jira/browse/MESOS-6482 > Project: Mesos > Issue Type: Bug > Components: master > Affects Versions: 1.1.0 > Reporter: Yan Xu > Priority: Blocker > > {noformat:title=} > I1025 16:34:55.423038 44118 master.cpp:6006] Marked agent > 8e219f7a-06c1-4009-9440-1a33b3e39be5-S473 (x.y.z.com) unreachable: health > check timed out > F1025 16:34:55.423632 44118 master.cpp:6036] Check failed: > frameworks.recovered.contains(frameworkId) > {noformat} > Both the master and the agent are on 1.1. -- This message was sent by Atlassian JIRA (v6.3.4#6332)