[ 
https://issues.apache.org/jira/browse/FLINK-30844?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17682532#comment-17682532
 ] 

Matthias Pohl commented on FLINK-30844:
---------------------------------------

{{TaskTest.testInterruptibleSharedLockInInvokeAndCancel}} caused the failure 
{code}
00:59:02,291 [Cancellation Watchdog for Test Task (1/1)#0 
(003bbd51a0b61b0ff2925c31e749f53e_00000000000000000000000000000000_0_0).] ERROR 
org.apache.flink.util.FatalExitExceptionHandler              [] - FATAL: Thread 
'Cancellation Watchdog for Test Task (1/1)#0 
(003bbd51a0b61b0ff2925c31e749f53e_00000000000000000000000000000000_0_0).' 
produced an uncaught exception. Stopping the process...
org.apache.flink.util.FlinkRuntimeException: Error in Task Cancellation Watch 
Dog
        at 
org.apache.flink.runtime.taskmanager.Task$TaskCancelerWatchDog.run(Task.java:1801)
 ~[classes/:?]
        at java.lang.Thread.run(Thread.java:748) [?:1.8.0_292]
Caused by: java.lang.RuntimeException: Unexpected FatalError notification
        at 
org.apache.flink.runtime.taskmanager.TaskTest$ProhibitFatalErrorTaskManagerActions.notifyFatalError(TaskTest.java:1278)
 ~[test-classes/:?]
        at 
org.apache.flink.runtime.taskmanager.Task$TaskCancelerWatchDog.run(Task.java:1798)
 ~[classes/:?]
        ... 1 more
{code}

The {{TaskCancelerWatchDog}} causes the System.exit when the executor thread is 
still alive (see 
[Task:1781|https://github.com/apache/flink/blob/3b6d08e57f644cddcdac1fb5a110d44172652c3a/flink-runtime/src/main/java/org/apache/flink/runtime/taskmanager/Task.java#L1781]).

 [~piotr.nowicki] [~akalashnikov] May one of you have a look at this?

> TaskTest failed fatally with exit code 239 (FatalExitExceptionHandler)
> ----------------------------------------------------------------------
>
>                 Key: FLINK-30844
>                 URL: https://issues.apache.org/jira/browse/FLINK-30844
>             Project: Flink
>          Issue Type: Bug
>          Components: Runtime / Coordination, Runtime / Task
>    Affects Versions: 1.17.0
>            Reporter: Matthias Pohl
>            Priority: Blocker
>              Labels: test-stability
>
> We're experiencing a fatal crash in {{TaskTest}}:
> https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=45440&view=logs&j=0da23115-68bb-5dcd-192c-bd4c8adebde1&t=24c3384f-1bcb-57b3-224f-51bf973bbee8&l=8334
> {code}
> [...]
> Jan 31 01:03:12 [ERROR] Process Exit Code: 239
> Jan 31 01:03:12 [ERROR] Crashed tests:
> Jan 31 01:03:12 [ERROR] org.apache.flink.runtime.taskmanager.TaskTest
> Jan 31 01:03:12 [ERROR] at 
> org.apache.maven.plugin.surefire.booterclient.ForkStarter.fork(ForkStarter.java:748)
> Jan 31 01:03:12 [ERROR] at 
> org.apache.maven.plugin.surefire.booterclient.ForkStarter.access$700(ForkStarter.java:121)
> Jan 31 01:03:12 [ERROR] at 
> org.apache.maven.plugin.surefire.booterclient.ForkStarter$1.call(ForkStarter.java:393)
> Jan 31 01:03:12 [ERROR] at 
> org.apache.maven.plugin.surefire.booterclient.ForkStarter$1.call(ForkStarter.java:370)
> Jan 31 01:03:12 [ERROR] at 
> java.util.concurrent.FutureTask.run(FutureTask.java:266)
> Jan 31 01:03:12 [ERROR] at 
> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
> Jan 31 01:03:12 [ERROR] at 
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
> Jan 31 01:03:12 [ERROR] at java.lang.Thread.run(Thread.java:748)
> Jan 31 01:03:12 [ERROR] -> [Help 1]
> Jan 31 01:03:12 [ERROR] 
> Jan 31 01:03:12 [ERROR] To see the full stack trace of the errors, re-run 
> Maven with the -e switch.
> Jan 31 01:03:12 [ERROR] Re-run Maven using the -X switch to enable full debug 
> logging.
> Jan 31 01:03:12 [ERROR] 
> Jan 31 01:03:12 [ERROR] For more information about the errors and possible 
> solutions, please read the following articles:
> Jan 31 01:03:12 [ERROR] [Help 1] 
> http://cwiki.apache.org/confluence/display/MAVEN/MojoExecutionException
> Jan 31 01:03:12 [ERROR] 
> Jan 31 01:03:12 [ERROR] After correcting the problems, you can resume the 
> build with the command
> Jan 31 01:03:12 [ERROR]   mvn <goals> -rf :flink-runtime
> {code}



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to