[
https://issues.apache.org/jira/browse/FLINK-24380?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
ASF GitHub Bot updated FLINK-24380:
-----------------------------------
Labels: pull-request-available (was: )
> Flink should handle the state transition of the pod from Pending to Failed
> --------------------------------------------------------------------------
>
> Key: FLINK-24380
> URL: https://issues.apache.org/jira/browse/FLINK-24380
> Project: Flink
> Issue Type: Bug
> Components: Runtime / Coordination
> Affects Versions: 1.14.0, 1.13.2
> Reporter: Yangze Guo
> Assignee: Yangze Guo
> Priority: Major
> Labels: pull-request-available
> Fix For: 1.14.0, 1.13.3, 1.15.0
>
>
> In K8s, there is five phases in pod's lifecycle: Pending, Running, Secceeded,
> Failed and Unknown. Currently, Flink does not handle the state transition of
> the pod from Pending to Failed. If a pod failed from Pending by `OutOfCPU` or
> `OutOfMem`, it will never be released and Flink keep waiting for it.
> To fix this issue, Flink should terminate the pod in Failed phase proactively.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)