[jira] [Commented] (MESOS-10194) Mesos master failure "Check failed: 'get_(role)' Must be SOME"

Andrei Sekretenko (Jira) Tue, 03 Nov 2020 06:25:49 -0800


    [ 
https://issues.apache.org/jira/browse/MESOS-10194?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17225453#comment-17225453
 ]


Andrei Sekretenko commented on MESOS-10194:
-------------------------------------------

[~Jerome Soussens] Looks related. This one is also caused by an attempt to 
untrack used resources, this time of a task.

I would guess that some resources used by some executor under this role were 
untracked for the second time before that; as a result, there "became not 
enough used resources left to untrack".

> Mesos master failure "Check failed: 'get_(role)' Must be SOME"
> --------------------------------------------------------------
>
>                 Key: MESOS-10194
>                 URL: https://issues.apache.org/jira/browse/MESOS-10194
>             Project: Mesos
>          Issue Type: Bug
>    Affects Versions: 1.10.0, 1.11.0
>            Reporter: Jerome Soussens
>            Assignee: Andrei Sekretenko
>            Priority: Critical
>         Attachments: log_mesos_crash_role_13102020.txt, 
> mesos_scalars_at_slaveId_crash.log
>
>
>  
> *Impact* : mesos-master crash with log :
> {code:java}
> hierarchical.cpp:460] Check failed: 'get_(role)' Must be SOME
> {code}
> *Possible scenario :*
> A framework, using a specific role, is stopped. More or less at the same 
> time, some remaining task status for this framework comes to the master from 
> the executor. But the roles is no more listed.
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

[jira] [Commented] (MESOS-10194) Mesos master failure "Check failed: 'get_(role)' Must be SOME"

Reply via email to