[
https://issues.apache.org/jira/browse/FLINK-32972?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17787848#comment-17787848
]
Matthias Pohl commented on FLINK-32972:
---------------------------------------
1.17:
https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=54681&view=logs&j=4d4a0d10-fca2-5507-8eed-c07f0bdf4887&t=7b25afdf-cc6c-566f-5459-359dc2585798&l=8750
> TaskTest.testInterruptibleSharedLockInInvokeAndCancel causes a JVM shutdown
> with exit code 239
> ----------------------------------------------------------------------------------------------
>
> Key: FLINK-32972
> URL: https://issues.apache.org/jira/browse/FLINK-32972
> Project: Flink
> Issue Type: Bug
> Components: Runtime / Coordination
> Affects Versions: 1.17.2
> Reporter: Sergey Nuyanzin
> Priority: Major
> Labels: test-stability
>
> Within this build
> [https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=52668&view=logs&j=b0a398c0-685b-599c-eb57-c8c2a771138e&t=747432ad-a576-5911-1e2a-68c6bedc248a&l=8677]
> it looks like task
> {{1ec32305eb0f926acae926007429c142_00000000000000000000000000000000_0_0}} was
> canceled
> {noformat}
> ================================================================================
> Test
> testInterruptibleSharedLockInInvokeAndCancel(org.apache.flink.runtime.taskmanager.TaskTest)
> is running.
> --------------------------------------------------------------------------------
> 01:30:05,140 [ main] INFO
> org.apache.flink.runtime.io.network.NettyShuffleServiceFactory [] - Created a
> new FileChannelManager for storing result partitions of BLOCKING shuffles.
> Used directories:
> /tmp/flink-netty-shuffle-82415974-782a-46db-afbc-8f18f30a4ec5
> 01:30:05,177 [ main] INFO
> org.apache.flink.runtime.io.network.buffer.NetworkBufferPool [] - Allocated
> 32 MB for network buffer pool (number of memory segments: 1024, bytes per
> segment: 32768).
> 01:30:05,181 [ Test Task (1/1)#0] INFO
> org.apache.flink.runtime.taskmanager.Task [] - Test Task
> (1/1)#0
> (1ec32305eb0f926acae926007429c142_00000000000000000000000000000000_0_0)
> switched from CREATED to DEPLOYING.
> 01:30:05,190 [ Test Task (1/1)#0] INFO
> org.apache.flink.runtime.taskmanager.Task [] - Loading JAR
> files for task Test Task (1/1)#0
> (1ec32305eb0f926acae926007429c142_00000000000000000000000000000000_0_0)
> [DEPLOYING].
> 01:30:05,192 [ Test Task (1/1)#0] INFO
> org.apache.flink.runtime.taskmanager.Task [] - Test Task
> (1/1)#0
> (1ec32305eb0f926acae926007429c142_00000000000000000000000000000000_0_0)
> switched from DEPLOYING to INITIALIZING.
> 01:30:05,192 [ Test Task (1/1)#0] INFO
> org.apache.flink.runtime.taskmanager.Task [] - Test Task
> (1/1)#0
> (1ec32305eb0f926acae926007429c142_00000000000000000000000000000000_0_0)
> switched from INITIALIZING to RUNNING.
> 01:30:05,195 [ main] INFO
> org.apache.flink.runtime.taskmanager.Task [] - Attempting
> to cancel task Test Task (1/1)#0
> (1ec32305eb0f926acae926007429c142_00000000000000000000000000000000_0_0).
> 01:30:05,196 [ main] INFO
> org.apache.flink.runtime.taskmanager.Task [] - Test Task
> (1/1)#0
> (1ec32305eb0f926acae926007429c142_00000000000000000000000000000000_0_0)
> switched from RUNNING to CANCELING.
> 01:30:05,196 [ main] INFO
> org.apache.flink.runtime.taskmanager.Task [] - Triggering
> cancellation of task code Test Task (1/1)#0
> (1ec32305eb0f926acae926007429c142_00000000000000000000000000000000_0_0).
> 01:30:05,197 [ Test Task (1/1)#0] INFO
> org.apache.flink.runtime.taskmanager.Task [] - Test Task
> (1/1)#0
> (1ec32305eb0f926acae926007429c142_00000000000000000000000000000000_0_0)
> switched from CANCELING to CANCELED.
> 01:30:05,198 [ Test Task (1/1)#0] INFO
> org.apache.flink.runtime.taskmanager.Task [] - Freeing
> task resources for Test Task (1/1)#0
> (1ec32305eb0f926acae926007429c142_00000000000000000000000000000000_0_0).
> {noformat}
> and after that there are records in logs complaining htat task did not react
> {noformat}
> 01:30:05,337 [Canceler/Interrupts for Test Task (1/1)#0
> (1ec32305eb0f926acae926007429c142_00000000000000000000000000000000_0_0).]
> WARN org.apache.flink.runtime.taskmanager.Task [] - Task
> 'Test Task (1/1)#0' did not react to cancelling signal - interrupting; it is
> stuck for 0 seconds in method:
>
> app//org.apache.flink.runtime.metrics.groups.AbstractMetricGroup.close(AbstractMetricGroup.java:322)
> app//org.apache.flink.runtime.metrics.groups.AbstractMetricGroup.close(AbstractMetricGroup.java:327)
> app//org.apache.flink.runtime.metrics.groups.AbstractMetricGroup.close(AbstractMetricGroup.java:327)
> app//org.apache.flink.runtime.metrics.groups.AbstractMetricGroup.close(AbstractMetricGroup.java:327)
> app//org.apache.flink.runtime.metrics.groups.AbstractMetricGroup.close(AbstractMetricGroup.java:327)
> app//org.apache.flink.runtime.metrics.groups.ComponentMetricGroup.close(ComponentMetricGroup.java:62)
> app//org.apache.flink.runtime.metrics.groups.TaskMetricGroup.close(TaskMetricGroup.java:179)
> app//org.apache.flink.runtime.taskmanager.Task.doRun(Task.java:866)
> app//org.apache.flink.runtime.taskmanager.Task.run(Task.java:562)
> [email protected]/java.lang.Thread.run(Thread.java:829)
> {noformat}
--
This message was sent by Atlassian Jira
(v8.20.10#820010)