[ https://issues.apache.org/jira/browse/MESOS-540?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13702503#comment-13702503 ]
Benjamin Hindman commented on MESOS-540: ---------------------------------------- I like it! > Executor health checking. > ------------------------- > > Key: MESOS-540 > URL: https://issues.apache.org/jira/browse/MESOS-540 > Project: Mesos > Issue Type: Improvement > Reporter: Benjamin Mahler > > We currently do not health check running executors. > At Twitter, this has led to out-of-band health checking of executors for an > internal framework. > For the Storm framework, this has led to out-of-band health checking via > ZooKeeper. Health checking would allow Storm to use finer grained executors > for better isolation. > This also helps the Hadoop and Jenkins frameworks as well should health > checking be desired. > As for implementation, I would propose adding a call on the Executor > interface: > /** > * Invoked by the ExecutorDriver to determine the health of the executor. > * When this function returns, the Executor is considered healthy. > */ > void heartbeat(ExecutorDriver* driver) = 0; > The driver can then heartbeat periodically and kill when the Executor is not > responding to heartbeats. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira