[
https://issues.apache.org/jira/browse/FLINK-34425?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17816971#comment-17816971
]
Matthias Pohl edited comment on FLINK-34425 at 2/13/24 11:48 AM:
-----------------------------------------------------------------
This looks like a test issue. The TaskManager process is destroyed in
[TaskManagerRunnerITCase:124|https://github.com/apache/flink/blob/d6c7eee8243b4fe3e593698f250643534dc79cb5/flink-tests/src/test/java/org/apache/flink/test/recovery/TaskManagerRunnerITCase.java#L124]
but doesn't get back properly causing the timeout in {{#waitFor()}} in
[TaskManagerRunnerITCase:126|https://github.com/apache/flink/blob/d6c7eee8243b4fe3e593698f250643534dc79cb5/flink-tests/src/test/java/org/apache/flink/test/recovery/TaskManagerRunnerITCase.java#L126].
I'm gonna lower this issue's priority to {{Major}}. I don't consider it in any
way problematic for the upcoming 1.19 release.
was (Author: mapohl):
This looks like a test issue. The TaskManager process is destroyed in
[TaskManagerRunnerITCase:124|https://github.com/apache/flink/blob/d6c7eee8243b4fe3e593698f250643534dc79cb5/flink-tests/src/test/java/org/apache/flink/test/recovery/TaskManagerRunnerITCase.java#L124]
but doesn't get back properly causing the timeout in {{#waitFor()}} in
[TaskManagerRunnerITCase:126|https://github.com/apache/flink/blob/d6c7eee8243b4fe3e593698f250643534dc79cb5/flink-tests/src/test/java/org/apache/flink/test/recovery/TaskManagerRunnerITCase.java#L126].
> TaskManagerRunnerITCase#testNondeterministicWorkingDirIsDeletedInCaseOfProcessFailure
> times out
> -----------------------------------------------------------------------------------------------
>
> Key: FLINK-34425
> URL: https://issues.apache.org/jira/browse/FLINK-34425
> Project: Flink
> Issue Type: Bug
> Components: Runtime / Coordination
> Affects Versions: 1.19.0, 1.20.0
> Reporter: Matthias Pohl
> Assignee: Matthias Pohl
> Priority: Critical
> Labels: test-stability
>
> https://github.com/apache/flink/actions/runs/7851900616/job/21429757962#step:10:8844
> {code}
> Feb 10 03:21:45 "main" #1 [498632] prio=5 os_prio=0 cpu=619.91ms
> elapsed=1653.40s tid=0x00007fbd29695000 nid=498632 waiting on condition
> [0x00007fbd2b9f3000]
> Feb 10 03:21:45 java.lang.Thread.State: WAITING (parking)
> Feb 10 03:21:45 at
> jdk.internal.misc.Unsafe.park([email protected]/Native Method)
> Feb 10 03:21:45 - parking to wait for <0x00000000ae6199f0> (a
> java.util.concurrent.locks.AbstractQueuedSynchronizer$ConditionObject)
> Feb 10 03:21:45 at
> java.util.concurrent.locks.LockSupport.park([email protected]/LockSupport.java:371)
> Feb 10 03:21:45 at
> java.util.concurrent.locks.AbstractQueuedSynchronizer$ConditionNode.block([email protected]/AbstractQueuedSynchronizer.java:519)
> Feb 10 03:21:45 at
> java.util.concurrent.ForkJoinPool.unmanagedBlock([email protected]/ForkJoinPool.java:3780)
> Feb 10 03:21:45 at
> java.util.concurrent.ForkJoinPool.managedBlock([email protected]/ForkJoinPool.java:3725)
> Feb 10 03:21:45 at
> java.util.concurrent.locks.AbstractQueuedSynchronizer$ConditionObject.await([email protected]/AbstractQueuedSynchronizer.java:1707)
> Feb 10 03:21:45 at
> java.lang.ProcessImpl.waitFor([email protected]/ProcessImpl.java:425)
> Feb 10 03:21:45 at
> org.apache.flink.test.recovery.TaskManagerRunnerITCase.testNondeterministicWorkingDirIsDeletedInCaseOfProcessFailure(TaskManagerRunnerITCase.java:126)
> Feb 10 03:21:45 at
> java.lang.invoke.LambdaForm$DMH/0x00007fbccb1b8000.invokeVirtual([email protected]/LambdaForm$DMH)
> Feb 10 03:21:45 at
> java.lang.invoke.LambdaForm$MH/0x00007fbccb1b8800.invoke([email protected]/LambdaForm$MH)
> [...]
> {code}
--
This message was sent by Atlassian Jira
(v8.20.10#820010)