[
https://issues.apache.org/jira/browse/FLINK-36451?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17904388#comment-17904388
]
Matthias Pohl edited comment on FLINK-36451 at 12/11/24 3:41 PM:
-----------------------------------------------------------------
* master:
**
[b32181bb95d8865b8d3eac3c6a6ed4f0f0c14a98|https://github.com/apache/flink/commit/b32181bb95d8865b8d3eac3c6a6ed4f0f0c14a98]
** hotfix:
[0df38635655b87df85e7f0d786b7f4ef285debce|https://github.com/apache/flink/commit/0df38635655b87df85e7f0d786b7f4ef285debce]
** hotfix:
[e3b060357eca3566e69180639f6337a9249c8c2a|https://github.com/apache/flink/commit/e3b060357eca3566e69180639f6337a9249c8c2a]
* 1.20:
**
[09a7e7e2ac1c0ed51faaadfde21791e4fc7feb8e|https://github.com/apache/flink/commit/09a7e7e2ac1c0ed51faaadfde21791e4fc7feb8e]
**
[8be40eaec4f260c3863699558192740032066605|https://github.com/apache/flink/commit/8be40eaec4f260c3863699558192740032066605]
**
[494cf93bc009158629ab5e944bbf55c1c0cfbe70|https://github.com/apache/flink/commit/494cf93bc009158629ab5e944bbf55c1c0cfbe70]
* 1.19:
**
[61b1c337c900a646d2cf23757ce22ef563b8f337|https://github.com/apache/flink/commit/61b1c337c900a646d2cf23757ce22ef563b8f337]
**
[5bd04777cc812278f4961dfb602fcfcb2cf52ee9|https://github.com/apache/flink/commit/5bd04777cc812278f4961dfb602fcfcb2cf52ee9]
**
[ca54ab7f0116e83e51ec8d31cfc95271130dda92|https://github.com/apache/flink/commit/ca54ab7f0116e83e51ec8d31cfc95271130dda92]
was (Author: mapohl):
* master:
**
[b32181bb95d8865b8d3eac3c6a6ed4f0f0c14a98|https://github.com/apache/flink/commit/b32181bb95d8865b8d3eac3c6a6ed4f0f0c14a98]
** hotfix:
[0df38635655b87df85e7f0d786b7f4ef285debce|https://github.com/apache/flink/commit/0df38635655b87df85e7f0d786b7f4ef285debce]
** hotfix:
[e3b060357eca3566e69180639f6337a9249c8c2a|https://github.com/apache/flink/commit/e3b060357eca3566e69180639f6337a9249c8c2a]
* 1.20:
**
[09a7e7e2ac1c0ed51faaadfde21791e4fc7feb8e|https://github.com/apache/flink/commit/09a7e7e2ac1c0ed51faaadfde21791e4fc7feb8e]
**
[8be40eaec4f260c3863699558192740032066605|https://github.com/apache/flink/commit/8be40eaec4f260c3863699558192740032066605]
**
[494cf93bc009158629ab5e944bbf55c1c0cfbe70|https://github.com/apache/flink/commit/494cf93bc009158629ab5e944bbf55c1c0cfbe70]
* 1.19: tba
> Kubernetes Application JobManager Potential Deadlock and TaskManager Pod
> Residuals
> ----------------------------------------------------------------------------------
>
> Key: FLINK-36451
> URL: https://issues.apache.org/jira/browse/FLINK-36451
> Project: Flink
> Issue Type: Bug
> Components: Runtime / Coordination
> Affects Versions: 1.19.1
> Environment: * Flink version: 1.19.1
> * - Deployment mode: Flink Kubernetes Application Mode
> * - JVM version: OpenJDK 17
>
> Reporter: xiechenling
> Assignee: Matthias Pohl
> Priority: Major
> Labels: pull-request-available
> Attachments: 1.png, 2.png, jobmanager.log, jstack.txt
>
>
> In Kubernetes Application Mode, when there is significant etcd latency or
> instability, the Flink JobManager may enter a deadlock situation.
> Additionally, TaskManager pods are not cleaned up properly, resulting in
> stale resources that prevent the Flink job from recovering correctly. This
> issue occurs during frequent service restarts or network instability.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)