[
https://issues.apache.org/jira/browse/FLINK-30844?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17682532#comment-17682532
]
Matthias Pohl commented on FLINK-30844:
---------------------------------------
{{TaskTest.testInterruptibleSharedLockInInvokeAndCancel}} caused the failure
{code}
00:59:02,291 [Cancellation Watchdog for Test Task (1/1)#0
(003bbd51a0b61b0ff2925c31e749f53e_00000000000000000000000000000000_0_0).] ERROR
org.apache.flink.util.FatalExitExceptionHandler [] - FATAL: Thread
'Cancellation Watchdog for Test Task (1/1)#0
(003bbd51a0b61b0ff2925c31e749f53e_00000000000000000000000000000000_0_0).'
produced an uncaught exception. Stopping the process...
org.apache.flink.util.FlinkRuntimeException: Error in Task Cancellation Watch
Dog
at
org.apache.flink.runtime.taskmanager.Task$TaskCancelerWatchDog.run(Task.java:1801)
~[classes/:?]
at java.lang.Thread.run(Thread.java:748) [?:1.8.0_292]
Caused by: java.lang.RuntimeException: Unexpected FatalError notification
at
org.apache.flink.runtime.taskmanager.TaskTest$ProhibitFatalErrorTaskManagerActions.notifyFatalError(TaskTest.java:1278)
~[test-classes/:?]
at
org.apache.flink.runtime.taskmanager.Task$TaskCancelerWatchDog.run(Task.java:1798)
~[classes/:?]
... 1 more
{code}
The {{TaskCancelerWatchDog}} causes the System.exit when the executor thread is
still alive (see
[Task:1781|https://github.com/apache/flink/blob/3b6d08e57f644cddcdac1fb5a110d44172652c3a/flink-runtime/src/main/java/org/apache/flink/runtime/taskmanager/Task.java#L1781]).
[~piotr.nowicki] [~akalashnikov] May one of you have a look at this?
> TaskTest failed fatally with exit code 239 (FatalExitExceptionHandler)
> ----------------------------------------------------------------------
>
> Key: FLINK-30844
> URL: https://issues.apache.org/jira/browse/FLINK-30844
> Project: Flink
> Issue Type: Bug
> Components: Runtime / Coordination, Runtime / Task
> Affects Versions: 1.17.0
> Reporter: Matthias Pohl
> Priority: Blocker
> Labels: test-stability
>
> We're experiencing a fatal crash in {{TaskTest}}:
> https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=45440&view=logs&j=0da23115-68bb-5dcd-192c-bd4c8adebde1&t=24c3384f-1bcb-57b3-224f-51bf973bbee8&l=8334
> {code}
> [...]
> Jan 31 01:03:12 [ERROR] Process Exit Code: 239
> Jan 31 01:03:12 [ERROR] Crashed tests:
> Jan 31 01:03:12 [ERROR] org.apache.flink.runtime.taskmanager.TaskTest
> Jan 31 01:03:12 [ERROR] at
> org.apache.maven.plugin.surefire.booterclient.ForkStarter.fork(ForkStarter.java:748)
> Jan 31 01:03:12 [ERROR] at
> org.apache.maven.plugin.surefire.booterclient.ForkStarter.access$700(ForkStarter.java:121)
> Jan 31 01:03:12 [ERROR] at
> org.apache.maven.plugin.surefire.booterclient.ForkStarter$1.call(ForkStarter.java:393)
> Jan 31 01:03:12 [ERROR] at
> org.apache.maven.plugin.surefire.booterclient.ForkStarter$1.call(ForkStarter.java:370)
> Jan 31 01:03:12 [ERROR] at
> java.util.concurrent.FutureTask.run(FutureTask.java:266)
> Jan 31 01:03:12 [ERROR] at
> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
> Jan 31 01:03:12 [ERROR] at
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
> Jan 31 01:03:12 [ERROR] at java.lang.Thread.run(Thread.java:748)
> Jan 31 01:03:12 [ERROR] -> [Help 1]
> Jan 31 01:03:12 [ERROR]
> Jan 31 01:03:12 [ERROR] To see the full stack trace of the errors, re-run
> Maven with the -e switch.
> Jan 31 01:03:12 [ERROR] Re-run Maven using the -X switch to enable full debug
> logging.
> Jan 31 01:03:12 [ERROR]
> Jan 31 01:03:12 [ERROR] For more information about the errors and possible
> solutions, please read the following articles:
> Jan 31 01:03:12 [ERROR] [Help 1]
> http://cwiki.apache.org/confluence/display/MAVEN/MojoExecutionException
> Jan 31 01:03:12 [ERROR]
> Jan 31 01:03:12 [ERROR] After correcting the problems, you can resume the
> build with the command
> Jan 31 01:03:12 [ERROR] mvn <goals> -rf :flink-runtime
> {code}
--
This message was sent by Atlassian Jira
(v8.20.10#820010)