Zhitao Li created MESOS-8725:
--------------------------------

             Summary: Support deadline for tasks
                 Key: MESOS-8725
                 URL: https://issues.apache.org/jira/browse/MESOS-8725
             Project: Mesos
          Issue Type: Improvement
            Reporter: Zhitao Li


In our environment, we run a lot of batch jobs, some of which have tight 
timeline. If any tasks in the job runs longer than x hours, it does not make 
sense to run it anymore. 
 
For instance, a team would submit a job which builds a weekly index and repeats 
every Monday. If the job does not finish before next Monday for whatever 
reason, there is no point to keep any task running.
 
We believe that implementing deadline tracking distributed across our cluster 
makes more sense as it makes the system more scalable and also makes our 
centralized state machine simpler.
 
One idea I have right now is to add an  *optional* *TimeInfo deadline* to 
TaskInfo field, and all default executors in Mesos can simply terminate the 
task and send a proper *StatusUpdate.*



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to