Stephan Ewen created FLINK-23843:
------------------------------------

             Summary: Exceptions during 
"SplitEnumeratorContext.runInCoordinatorThread()" should cause Global Failure 
instead of Process Kill
                 Key: FLINK-23843
                 URL: https://issues.apache.org/jira/browse/FLINK-23843
             Project: Flink
          Issue Type: Improvement
          Components: Runtime / Coordination
    Affects Versions: 1.13.2
            Reporter: Stephan Ewen
            Assignee: Stephan Ewen
             Fix For: 1.14.0


Currently, when a the method "SplitEnumeratorContext.runInCoordinatorThread()" 
throws an exception, the effect is a process kill of the JobManager process.

The chain how the process kill happens is:
* An exception bubbling up in the executor, killing the executor thread
* The executor starts a replacement thread, which is forbidden by the thread 
factory (as a safety net) and causes a process kill.

We should prevent such exceptions from bubbling up in the coordinator executor.




--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to