Chesnay Schepler created FLINK-25611:
----------------------------------------

             Summary: Remove CoordinatorExecutorThreadFactory thread creation 
guards
                 Key: FLINK-25611
                 URL: https://issues.apache.org/jira/browse/FLINK-25611
             Project: Flink
          Issue Type: Improvement
          Components: API / Core
            Reporter: Chesnay Schepler
            Assignee: Chesnay Schepler
             Fix For: 1.15.0, 1.13.6, 1.14.3


The CoordinatorExecutorThreadFactory of the SourceCoordinator checks that only 
a single thread is active and that no new thread can be created if the previous 
one failed.

Neither of these guards work properly. If a runnable in the ThreadPoolExecutor 
fails then it actually uses the worker thread of the failed runnable to spawn a 
new worker. This means that at the time the second thread is created the 
previous thread is still alive, and the exception that caused the failure 
hasn't even been propagated to the threads exception handler.

As these guards do not work, and to boot result in the actual failure causes 
being hidden (like in FLINK-24855), we should remove them.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

Reply via email to