Gyula Fora created FLINK-35414:
----------------------------------
Summary: Cancel jobs through rest api for last-state upgrades
Key: FLINK-35414
URL: https://issues.apache.org/jira/browse/FLINK-35414
Project: Flink
Issue Type: Improvement
Components: Kubernetes Operator
Reporter: Gyula Fora
Assignee: Gyula Fora
The kubernetes operator currently always deletes the JM deployment directly
during last-state upgrades instead of attempting any type of graceful shutdown.
We could improve the last-state upgrade logic to cancel the job in cases where
the JM is healthy and then simply extract the last checkpoint info through the
rest api like we already do for terminal job states.
This would allow the last-state upgrade mode to work even for session jobs and
this may even eliminate a few corner cases that can result from the current
forceful upgrade mechanism.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)