[ 
https://issues.apache.org/jira/browse/MESOS-1364?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13997116#comment-13997116
 ] 

Till Toenshoff commented on MESOS-1364:
---------------------------------------

There are at least two possible ways to handle the specific {{destroy}} 
invocation that is needed for the above cases and I would like to draft those 
to get your input.

As of now, the {{destroy}} call within the ExternalContainerizerProcess expects 
a {{ContainerID}} to be among those that are covered by its internal states ( 
{{actives}} ). This is needed for things like providing a proper sandbox 
environment, as in redirecting {{stderr}} output to the correct output as 
identified in the {{launch}} call of that specific {{ContainerID}}. Clearly 
such sandbox can not be provided for orphaned containers. 

The alternatives I can see right now are:
(1) Never validate if a container is among the actives within the {{destroy}} 
call and always handle such call as a forceful destruction.
(2) Add a flag to the {{destroy}} parameters, allowing it to be enforced for 
the above case but not enforced (still using a sandbox) for all other cases.

Even though I have already implemented a patch according to (2), I now got to a 
stage where I think that we should always handle a destroy forcefully (hence 
according to (1) ).

> ExternalContainerizer 'recover' should destroy orphaned ContainerIDs
> --------------------------------------------------------------------
>
>                 Key: MESOS-1364
>                 URL: https://issues.apache.org/jira/browse/MESOS-1364
>             Project: Mesos
>          Issue Type: Bug
>    Affects Versions: 0.19.0
>            Reporter: Till Toenshoff
>            Assignee: Till Toenshoff
>              Labels: ExternalContainerizer
>
> Currently the EC does ignore any {{ContaineriID}} s that got returned by the 
> ECP {{containers}} invocation but are not recoverable (e.g. missing slave 
> state). Those should get killed by the EC invoking {{destroy}} for those 
> {{ContainerID}} s on the ECP just like the LinuxLauncher does this in 
> connection with the MesosContainerizer. 



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to