[
https://issues.apache.org/jira/browse/FLINK-35522?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Zhu Zhu closed FLINK-35522.
---------------------------
Fix Version/s: 1.18.2
1.19.2
Resolution: Fixed
1.18: 8ae7986e1dca80a686b678feb4ea3bdfff4a19bb
1.19: 7afa6eaae3218f273c7971434ea88a2cfd966ced
> The source task may get stuck after a failover occurs in batch jobs
> -------------------------------------------------------------------
>
> Key: FLINK-35522
> URL: https://issues.apache.org/jira/browse/FLINK-35522
> Project: Flink
> Issue Type: Bug
> Components: Runtime / Coordination
> Affects Versions: 1.17.2, 1.19.0, 1.18.1, 1.20.0
> Reporter: xingbe
> Assignee: xingbe
> Priority: Major
> Labels: pull-request-available
> Fix For: 1.18.2, 1.20.0, 1.19.2
>
>
> If the source task does not get assigned a split because the SplitEnumerator
> has no more splits, and a failover occurs during the closing process, the
> SourceCoordinatorContext will not resend the NoMoreSplit event to the newly
> started source task, causing the source vertex to remain stuck indefinitely.
> This case may only occur in batch jobs where speculative execution has been
> enabled.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)