[
https://issues.apache.org/jira/browse/FLINK-23300?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17376966#comment-17376966
]
Zhilong Hong commented on FLINK-23300:
--------------------------------------
I think FLINK-23202 and FLINK-23209 are working on solving this issue.
> Job fails very slow because of no notifyAllocationFailure for
> DeclarativeSlotManager
> ------------------------------------------------------------------------------------
>
> Key: FLINK-23300
> URL: https://issues.apache.org/jira/browse/FLINK-23300
> Project: Flink
> Issue Type: Improvement
> Components: Runtime / Task
> Affects Versions: 1.13.1
> Reporter: Liu
> Priority: Major
>
> When container is killed, flink on yarn can detect the problem very quickly.
> But when using default DeclarativeSlotManager, notifyAllocationFailure is not
> called and the task is not failed until heartbeat is timeout. So the failover
> will be very slow.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)