Liu created FLINK-25566:
---------------------------
Summary: Fail to cancel task if disk is bad for
java.lang.NoClassDefFoundError
Key: FLINK-25566
URL: https://issues.apache.org/jira/browse/FLINK-25566
Project: Flink
Issue Type: Improvement
Components: Runtime / Task
Reporter: Liu
Attachments: image-2022-01-07-19-07-10-968.png,
image-2022-01-07-19-08-49-038.png, image-2022-01-07-19-11-39-448.png
When we detecting disk error, we will restart the job to rescale. However, the
related task will stuck in cancelling for java.lang.NoClassDefFoundError.
!image-2022-01-07-19-08-49-038.png|width=743,height=157!
In the TaskManagerRunner's method onFatalError, it will not terminateJVM at
once. The process will stuck in the disk.
!image-2022-01-07-19-11-39-448.png|width=1085,height=400!
In this case, maybe we should terminate the container at once.
--
This message was sent by Atlassian Jira
(v8.20.1#820001)