[ 
https://issues.apache.org/jira/browse/MESOS-741?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14032769#comment-14032769
 ] 

Benjamin Hindman commented on MESOS-741:
----------------------------------------

Took a brief look at https://reviews.apache.org/r/22579 and saw the outcome of 
" I propose that we pass the PID of the executor to the health-checker program 
and send protobufs back with the health updates". Any reason not to just have 
the health-checker output the protobufs on it's stdout? It could even do this 
as JSON for human consumption and/ as recordio as we're already doing in 
src/usage/main.cpp for the mesos-usage utility. The value here is that the tool 
becomes standalone that someone can compose with even if they don't have the 
executor-like model.

> Add health checking for tasks
> -----------------------------
>
>                 Key: MESOS-741
>                 URL: https://issues.apache.org/jira/browse/MESOS-741
>             Project: Mesos
>          Issue Type: Story
>          Components: master, slave
>            Reporter: Niklas Quarfot Nielsen
>            Assignee: Timothy Chen
>
> Determining the health of a task during its lifetime (during start up, while 
> it is running, shutting down etc.) can be considered a more elaborate matter 
> than only observing its process state.
> The task health might be determined by any combination of observable 
> behavior; for example the process being listening to a certain range of 
> ports, writing certain files or pipes, responding to messages, utilizing 
> resources to or below certain thresholds etc.
> It could be a powerful extension to extend the interface for launching and 
> running tasks by an optional HealthCommand message. This message could encode:
> 1) A command to be run at the slave to determine the health of the task. The 
> return value of the command will tell if the task is healthy or unhealthy. 
> 2) An interval which to run the health command.
> In connection with this, it could make sense to introduce new healthy and 
> unhealthy task states.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to