It's the executor's responsibility to forcefully kill a task after the task
kill grace period. However, in your case it sounds like the executor is
getting stuck? What is happening in the executor? If the executor is alive
but doesn't implement the grace period force kill logic, the solution is to
Hi,
We have framework that launched Spark jobs on our Mesos cluster. We are
currently having an issue where Spark jobs are getting stuck due to some
timeout issue. We have cancel functionality that would kill send task_kill
message to master. When the jobs get stuck Spark driver task is not