[
https://issues.apache.org/jira/browse/MESOS-5067?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15219323#comment-15219323
]
Guillermo Rodriguez commented on MESOS-5067:
--------------------------------------------
Ok so I implemented the failover. Now the problem I have is that the launcher
is marathon and marathon adds a random string at the end of the name. IN other
words I will never get the same ID again.
Still I have set the failover to 3 days, so tasks will continue running.
Partial fix.
> Killing a framework does not kill framework tasks
> -------------------------------------------------
>
> Key: MESOS-5067
> URL: https://issues.apache.org/jira/browse/MESOS-5067
> Project: Mesos
> Issue Type: Wish
> Reporter: Guillermo Rodriguez
>
> By default, when a framework is terminated, mesos-master terminates all child
> tasks for that framework.
> There are some cases when I might like to stop a framework but not kill the
> tasks of the framework.
> In my particular case. I have Docker Swarm running, Swarm allows me to send
> number crunching jobs to the cluster and they can run for hours.
> The problem is that Swarm is also quite flaky and can crash anytime. If that
> happens then all jobs are terminated and all the processing time is lost.
> So, I would like to be able to set some flag for a framework where I tell
> mesos master that the jobs started by the framework should be considered
> separate of the framework itself so that the framework can be restarted and
> jobs will keep running.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)