Yun Gao created FLINK-27164: ------------------------------- Summary: ResumeCheckpointManuallyITCase.testExternalizedFSCheckpointsWithLocalRecoveryZookeeper failed on azure Key: FLINK-27164 URL: https://issues.apache.org/jira/browse/FLINK-27164 Project: Flink Issue Type: Bug Components: Runtime / Checkpointing, Runtime / Coordination Affects Versions: 1.15.0 Reporter: Yun Gao
{code:java} 2022-04-10T04:58:10.3690089Z Apr 10 04:58:10 [ERROR] Tests run: 36, Failures: 0, Errors: 1, Skipped: 0, Time elapsed: 344.79 s <<< FAILURE! - in org.apache.flink.test.checkpointing.ResumeCheckpointManuallyITCase 2022-04-10T04:58:10.3691728Z Apr 10 04:58:10 [ERROR] ResumeCheckpointManuallyITCase.testExternalizedFSCheckpointsWithLocalRecoveryZookeeper Time elapsed: 20.179 s <<< ERROR! 2022-04-10T04:58:10.3693629Z Apr 10 04:58:10 java.util.concurrent.ExecutionException: org.apache.flink.runtime.rpc.exceptions.RecipientUnreachableException: Could not send message [LocalRpcInvocation(RestfulGateway.requestJob(JobID, Time))] from sender [unknown] to recipient [akka://flink/user/rpc/dispatcher_171], because the recipient is unreachable. This can either mean that the recipient has been terminated or that the remote RpcService is currently not reachable. 2022-04-10T04:58:10.3695051Z Apr 10 04:58:10 at java.util.concurrent.CompletableFuture.reportGet(CompletableFuture.java:357) 2022-04-10T04:58:10.3696009Z Apr 10 04:58:10 at java.util.concurrent.CompletableFuture.get(CompletableFuture.java:1908) 2022-04-10T04:58:10.3697290Z Apr 10 04:58:10 at org.apache.flink.runtime.testutils.CommonTestUtils.lambda$waitForCheckpoint$13(CommonTestUtils.java:373) 2022-04-10T04:58:10.3698540Z Apr 10 04:58:10 at org.apache.flink.runtime.testutils.CommonTestUtils.waitUntilCondition(CommonTestUtils.java:166) 2022-04-10T04:58:10.3699709Z Apr 10 04:58:10 at org.apache.flink.runtime.testutils.CommonTestUtils.waitUntilCondition(CommonTestUtils.java:150) 2022-04-10T04:58:10.3700886Z Apr 10 04:58:10 at org.apache.flink.runtime.testutils.CommonTestUtils.waitUntilCondition(CommonTestUtils.java:142) 2022-04-10T04:58:10.3702047Z Apr 10 04:58:10 at org.apache.flink.runtime.testutils.CommonTestUtils.waitForCheckpoint(CommonTestUtils.java:371) 2022-04-10T04:58:10.3703031Z Apr 10 04:58:10 at org.apache.flink.test.checkpointing.ResumeCheckpointManuallyITCase.runJobAndGetExternalizedCheckpoint(ResumeCheckpointManuallyITCase.java:351) 2022-04-10T04:58:10.3704551Z Apr 10 04:58:10 at org.apache.flink.test.checkpointing.ResumeCheckpointManuallyITCase.testExternalizedCheckpoints(ResumeCheckpointManuallyITCase.java:312) 2022-04-10T04:58:10.3706643Z Apr 10 04:58:10 at org.apache.flink.test.checkpointing.ResumeCheckpointManuallyITCase.testExternalizedFSCheckpointsWithLocalRecoveryZookeeper(ResumeCheckpointManuallyITCase.java:247) 2022-04-10T04:58:10.3708169Z Apr 10 04:58:10 at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) 2022-04-10T04:58:10.3708780Z Apr 10 04:58:10 at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62) 2022-04-10T04:58:10.3709498Z Apr 10 04:58:10 at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43) 2022-04-10T04:58:10.3710144Z Apr 10 04:58:10 at java.lang.reflect.Method.invoke(Method.java:498) 2022-04-10T04:58:10.3710782Z Apr 10 04:58:10 at org.junit.runners.model.FrameworkMethod$1.runReflectiveCall(FrameworkMethod.java:59) 2022-04-10T04:58:10.3711494Z Apr 10 04:58:10 at org.junit.internal.runners.model.ReflectiveCallable.run(ReflectiveCallable.java:12) 2022-04-10T04:58:10.3712201Z Apr 10 04:58:10 at org.junit.runners.model.FrameworkMethod.invokeExplosively(FrameworkMethod.java:56) 2022-04-10T04:58:10.3712907Z Apr 10 04:58:10 at org.junit.internal.runners.statements.InvokeMethod.evaluate(InvokeMethod.java:17) 2022-04-10T04:58:10.3713579Z Apr 10 04:58:10 at org.apache.flink.util.TestNameProvider$1.evaluate(TestNameProvider.java:45) 2022-04-10T04:58:10.3714214Z Apr 10 04:58:10 at org.junit.rules.TestWatcher$1.evaluate(TestWatcher.java:61) 2022-04-10T04:58:10.3714923Z Apr 10 04:58:10 at org.junit.runners.ParentRunner$3.evaluate(ParentRunner.java:306) 2022-04-10T04:58:10.3715755Z Apr 10 04:58:10 at org.junit.runners.BlockJUnit4ClassRunner$1.evaluate(BlockJUnit4ClassRunner.java:100) 2022-04-10T04:58:10.3716427Z Apr 10 04:58:10 at org.junit.runners.ParentRunner.runLeaf(ParentRunner.java:366) 2022-04-10T04:58:10.3717083Z Apr 10 04:58:10 at org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.java:103) 2022-04-10T04:58:10.3717783Z Apr 10 04:58:10 at org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.java:63) 2022-04-10T04:58:10.3718420Z Apr 10 04:58:10 at org.junit.runners.ParentRunner$4.run(ParentRunner.java:331) 2022-04-10T04:58:10.3719273Z Apr 10 04:58:10 at org.junit.runners.ParentRunner$1.schedule(ParentRunner.java:79) 2022-04-10T04:58:10.3719898Z Apr 10 04:58:10 at org.junit.runners.ParentRunner.runChildren(ParentRunner.java:329) 2022-04-10T04:58:10.3720519Z Apr 10 04:58:10 at org.junit.runners.ParentRunner.access$100(ParentRunner.java:66) 2022-04-10T04:58:10.3721133Z Apr 10 04:58:10 at org.junit.runners.ParentRunner$2.evaluate(ParentRunner.java:293) 2022-04-10T04:58:10.3721738Z Apr 10 04:58:10 at org.junit.runners.ParentRunner.run(ParentRunner.java:413) 2022-04-10T04:58:10.3722289Z Apr 10 04:58:10 at org.junit.runners.Suite.runChild(Suite.java:128) 2022-04-10T04:58:10.3722833Z Apr 10 04:58:10 at org.junit.runners.Suite.runChild(Suite.java:27) 2022-04-10T04:58:10.3723401Z Apr 10 04:58:10 at org.junit.runners.ParentRunner$4.run(ParentRunner.java:331) 2022-04-10T04:58:10.3724006Z Apr 10 04:58:10 at org.junit.runners.ParentRunner$1.schedule(ParentRunner.java:79) 2022-04-10T04:58:10.3724631Z Apr 10 04:58:10 at org.junit.runners.ParentRunner.runChildren(ParentRunner.java:329) 2022-04-10T04:58:10.3725299Z Apr 10 04:58:10 at org.junit.runners.ParentRunner.access$100(ParentRunner.java:66) 2022-04-10T04:58:10.3725960Z Apr 10 04:58:10 at org.junit.runners.ParentRunner$2.evaluate(ParentRunner.java:293) 2022-04-10T04:58:10.3726598Z Apr 10 04:58:10 at org.junit.rules.ExternalResource$1.evaluate(ExternalResource.java:54) 2022-04-10T04:58:10.3727199Z Apr 10 04:58:10 at org.junit.rules.RunRules.evaluate(RunRules.java:20) 2022-04-10T04:58:10.3727789Z Apr 10 04:58:10 at org.junit.runners.ParentRunner$3.evaluate(ParentRunner.java:306) 2022-04-10T04:58:10.3728393Z Apr 10 04:58:10 at org.junit.runners.ParentRunner.run(ParentRunner.java:413) 2022-04-10T04:58:10.3728960Z Apr 10 04:58:10 at org.junit.runner.JUnitCore.run(JUnitCore.java:137) 2022-04-10T04:58:10.3729501Z Apr 10 04:58:10 at org.junit.runner.JUnitCore.run(JUnitCore.java:115) 2022-04-10T04:58:10.3730133Z Apr 10 04:58:10 at org.junit.vintage.engine.execution.RunnerExecutor.execute(RunnerExecutor.java:42) 2022-04-10T04:58:10.3730854Z Apr 10 04:58:10 at org.junit.vintage.engine.VintageTestEngine.executeAllChildren(VintageTestEngine.java:80) 2022-04-10T04:58:10.3731558Z Apr 10 04:58:10 at org.junit.vintage.engine.VintageTestEngine.execute(VintageTestEngine.java:72) 2022-04-10T04:58:10.3732307Z Apr 10 04:58:10 at org.junit.platform.launcher.core.EngineExecutionOrchestrator.execute(EngineExecutionOrchestrator.java:107) 2022-04-10T04:58:10.3733127Z Apr 10 04:58:10 at org.junit.platform.launcher.core.EngineExecutionOrchestrator.execute(EngineExecutionOrchestrator.java:88) 2022-04-10T04:58:10.3733964Z Apr 10 04:58:10 at org.junit.platform.launcher.core.EngineExecutionOrchestrator.lambda$execute$0(EngineExecutionOrchestrator.java:54) 2022-04-10T04:58:10.3735182Z Apr 10 04:58:10 at org.junit.platform.launcher.core.EngineExecutionOrchestrator.withInterceptedStreams(EngineExecutionOrchestrator.java:67) 2022-04-10T04:58:10.3736678Z Apr 10 04:58:10 at org.junit.platform.launcher.core.EngineExecutionOrchestrator.execute(EngineExecutionOrchestrator.java:52) 2022-04-10T04:58:10.3737494Z Apr 10 04:58:10 at org.junit.platform.launcher.core.DefaultLauncher.execute(DefaultLauncher.java:114) 2022-04-10T04:58:10.3738205Z Apr 10 04:58:10 at org.junit.platform.launcher.core.DefaultLauncher.execute(DefaultLauncher.java:86) 2022-04-10T04:58:10.3738983Z Apr 10 04:58:10 at org.junit.platform.launcher.core.DefaultLauncherSession$DelegatingLauncher.execute(DefaultLauncherSession.java:86) 2022-04-10T04:58:10.3739817Z Apr 10 04:58:10 at org.junit.platform.launcher.core.SessionPerRequestLauncher.execute(SessionPerRequestLauncher.java:53) 2022-04-10T04:58:10.3740614Z Apr 10 04:58:10 at org.apache.maven.surefire.junitplatform.JUnitPlatformProvider.execute(JUnitPlatformProvider.java:188) 2022-04-10T04:58:10.3741429Z Apr 10 04:58:10 at org.apache.maven.surefire.junitplatform.JUnitPlatformProvider.invokeAllTests(JUnitPlatformProvider.java:154) 2022-04-10T04:58:10.3742426Z Apr 10 04:58:10 at org.apache.maven.surefire.junitplatform.JUnitPlatformProvider.invoke(JUnitPlatformProvider.java:124) 2022-04-10T04:58:10.3743276Z Apr 10 04:58:10 at org.apache.maven.surefire.booter.ForkedBooter.runSuitesInProcess(ForkedBooter.java:428) 2022-04-10T04:58:10.3743987Z Apr 10 04:58:10 at org.apache.maven.surefire.booter.ForkedBooter.execute(ForkedBooter.java:162) 2022-04-10T04:58:10.3744653Z Apr 10 04:58:10 at org.apache.maven.surefire.booter.ForkedBooter.run(ForkedBooter.java:562) 2022-04-10T04:58:10.3745444Z Apr 10 04:58:10 at org.apache.maven.surefire.booter.ForkedBooter.main(ForkedBooter.java:548) 2022-04-10T04:58:10.3746780Z Apr 10 04:58:10 Caused by: org.apache.flink.runtime.rpc.exceptions.RecipientUnreachableException: Could not send message [LocalRpcInvocation(RestfulGateway.requestJob(JobID, Time))] from sender [unknown] to recipient [akka://flink/user/rpc/dispatcher_171], because the recipient is unreachable. This can either mean that the recipient has been terminated or that the remote RpcService is currently not reachable. 2022-04-10T04:58:10.3747926Z Apr 10 04:58:10 at com.sun.proxy.$Proxy35.requestJob(Unknown Source) 2022-04-10T04:58:10.3748573Z Apr 10 04:58:10 at org.apache.flink.runtime.minicluster.MiniCluster.lambda$getExecutionGraph$18(MiniCluster.java:878) 2022-04-10T04:58:10.3749315Z Apr 10 04:58:10 at java.util.concurrent.CompletableFuture.uniApply(CompletableFuture.java:616) 2022-04-10T04:58:10.3750002Z Apr 10 04:58:10 at java.util.concurrent.CompletableFuture.uniApplyStage(CompletableFuture.java:628) 2022-04-10T04:58:10.3750679Z Apr 10 04:58:10 at java.util.concurrent.CompletableFuture.thenApply(CompletableFuture.java:1996) 2022-04-10T04:58:10.3751389Z Apr 10 04:58:10 at org.apache.flink.runtime.minicluster.MiniCluster.runDispatcherCommand(MiniCluster.java:900) 2022-04-10T04:58:10.3752124Z Apr 10 04:58:10 at org.apache.flink.runtime.minicluster.MiniCluster.getExecutionGraph(MiniCluster.java:877) 2022-04-10T04:58:10.3752660Z Apr 10 04:58:10 ... 61 more 2022-04-10T04:58:10.3753395Z Apr 10 04:58:10 Caused by: akka.pattern.AskTimeoutException: Recipient [Actor[akka://flink/user/rpc/dispatcher_171#910550752]] had already been terminated. Message of type [org.apache.flink.runtime.rpc.messages.LocalFencedMessage]. {code} https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=34485&view=logs&j=a549b384-c55a-52c0-c451-00e0477ab6db&t=eef5922c-08d9-5ba3-7299-8393476594e7&l=12918 -- This message was sent by Atlassian Jira (v8.20.1#820001)