Vijay created FLINK-33944:
-----------------------------
Summary: Apache Flink: Process to restore more than one job on job
manager startup from the respective savepoints
Key: FLINK-33944
URL: https://issues.apache.org/jira/browse/FLINK-33944
Project: Flink
Issue Type: Bug
Components: Runtime / Checkpointing
Affects Versions: 1.18.0
Reporter: Vijay
We are using Flink (1.18) version for our Flink cluster. The job manager has
been deployed in "Application mode" and we are looking for a process to restore
multiple jobs (using their respective savepoint directories) when the job
manager is started. Currently, we have the option to restore only one job while
running "standalone-job.sh" using the --fromSavepoint and
--allowNonRestoredState. However, we need a way to trigger multiple job
executions via Java client.
Note: We are not using a Kubernetes native deployment, but we are using k8s
standalone mode of deployment.
*Expected process:*
# Before starting with the Flink/application image upgrade, trigger the
savepoints for all the current running jobs.
# Once the savepoints process completed for all jobs, will trigger the scale
down of job manager and task manager instances.
# Update the image version on the k8s deployment with the update application
image.
# After image version is updated, scale up the job manager and task manager.
# We need a process to restore the previously running jobs from the savepoint
dir and start all the jobs.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)