Liu created FLINK-23300:
---------------------------

             Summary: Job fails very slow because of no notifyAllocationFailure 
for DeclarativeSlotManager
                 Key: FLINK-23300
                 URL: https://issues.apache.org/jira/browse/FLINK-23300
             Project: Flink
          Issue Type: Improvement
          Components: Runtime / Task
    Affects Versions: 1.13.1
            Reporter: Liu


When container is killed, flink on yarn can detect the problem very quickly. 
But when using default DeclarativeSlotManager, notifyAllocationFailure is not 
called and the task is not failed until heartbeat is timeout. So the failover 
will be very slow. 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to