Zhu Zhu created FLINK-20626:
-------------------------------
Summary: Canceling a job when it is failing will result in job
hanging in CANCELING state
Key: FLINK-20626
URL: https://issues.apache.org/jira/browse/FLINK-20626
Project: Flink
Issue Type: Bug
Components: Runtime / Coordination
Affects Versions: 1.11.2, 1.12.0
Reporter: Zhu Zhu
Assignee: Zhu Zhu
Fix For: 1.13.0, 1.11.4, 1.12.1
If user manually cancels a job when the job is failing(here failing means the
job encounters unrecoverable failure and is about to fail), the job will hang
in CANCELING state and cannot terminate. The cause is that DefaultScheduler
currently will always try to transition from `FAILING` to `FAILED` to terminate
the job. However, job canceling will change job status to `CANCELING` so that
the transition to `FAILED` will not success.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)