Dear Mesos Users,
I just wanted to point out a solved issue 
(https://issues.apache.org/jira/browse/MESOS-2419 
<https://issues.apache.org/jira/browse/MESOS-2419>) where the systemd default 
behaviour prevents tasks from recovering.

The problem is that the default KillMode for systemd processes is cgroup 
(http://www.freedesktop.org/software/systemd/man/systemd.kill.html 
<http://www.freedesktop.org/software/systemd/man/systemd.kill.html>) and hence 
all child processes are killed when the slave stops.
Explicitly setting the KillMode to process allows the executors to survive and 
reconnect. 

Feel free to check our configuration at: 
https://github.com/mesosphere/mesos-deb-packaging/blob/master/systemd/slave.systemd
 
<https://github.com/mesosphere/mesos-deb-packaging/blob/master/systemd/slave.systemd>
Thanks,
Joerg

Reply via email to