One small addition: The old mapping looks to use the SubtaskStateMapper.RANGE whereas the new mapping looks to use the SubtaskStateMapper.ROUND_ROBIN.
On Mon, May 17, 2021 at 11:56 AM Till Rohrmann <trohrm...@apache.org> wrote: > Hi ChangZhuo Chen, > > This looks like a bug in Flink. Could you provide us with the logs of the > run and more information about your job? In particular, how does your > topology look like? > > My suspicion is the following: You have an operator with two inputs. One > input is keyed whereas the other input is something else. Due to this > property, the JobVertex has two different SubtaskStateMappers assigned > which produce different subtask mappings in the TaskStateAssignment. This > is the exception you are observing. The initial deployment works because > you don't have any state to recover. However, subsequent recoveries should > fail. I am also pulling in Arvid who worked on the subtask state assignment > recently and might be able to shed some more light on this matter. > > Cheers, > Till > > On Fri, May 14, 2021 at 4:35 AM ChangZhuo Chen (陳昌倬) <czc...@czchen.org> > wrote: > >> Hi, >> >> We run our application in Flink 1.13.0, Kubernetes standalone >> application cluster with reactive mode enabled. The application has >> stopped and cannot restore today, so we try to restore the application >> from checkpoint. However, the application cannot restart from checkpoint >> due to the following error. We have no idea the meaning of this >> exception, so any help is welcome. >> >> >> 2021-05-14 01:55:37,204 INFO >> org.apache.flink.runtime.jobmaster.JobMaster [] - Close >> ResourceManager connection 06d772aae2ab4afb8c6917dac40cd727: Stopping >> JobMaster for job >> rt-match_11.2.16_5d671ba3(00000000000000000000000000000000).. >> 2021-05-14 01:55:37,205 INFO >> org.apache.flink.runtime.leaderretrieval.DefaultLeaderRetrievalService [] - >> Stopping DefaultLeaderRetrievalService. >> 2021-05-14 01:55:37,205 INFO >> org.apache.flink.kubernetes.highavailability.KubernetesLeaderRetrievalDriver >> [] - Stopping >> KubernetesLeaderRetrievalDriver{configMapName='rt-match-flink-dev-resourcemanager-leader'}. >> 2021-05-14 01:55:37,205 INFO >> org.apache.flink.kubernetes.kubeclient.resources.KubernetesConfigMapWatcher >> [] - The watcher is closing. >> 2021-05-14 01:55:37,215 INFO >> org.apache.flink.runtime.leaderelection.DefaultLeaderElectionService [] - >> Stopping DefaultLeaderElectionService. >> 2021-05-14 01:55:37,215 INFO >> org.apache.flink.kubernetes.highavailability.KubernetesLeaderElectionDriver >> [] - Closing >> KubernetesLeaderElectionDriver{configMapName='rt-match-flink-dev-00000000000000000000000000000000-jobmanager-leader'}. >> 2021-05-14 01:55:37,216 INFO >> org.apache.flink.kubernetes.kubeclient.resources.KubernetesConfigMapWatcher >> [] - The watcher is closing. >> 2021-05-14 01:55:37,342 INFO >> org.apache.flink.runtime.jobmanager.DefaultJobGraphStore [] - Removed >> job graph 00000000000000000000000000000000 from >> KubernetesStateHandleStore{configMapName='rt-match-flink-dev-dispatcher-leader'}. >> 2021-05-14 01:55:37,914 INFO >> org.apache.flink.client.deployment.application.ApplicationDispatcherBootstrap >> [] - Application FAILED: >> java.util.concurrent.CompletionException: >> org.apache.flink.client.deployment.application.UnsuccessfulExecutionException: >> Application Status: FAILED >> at >> org.apache.flink.client.deployment.application.ApplicationDispatcherBootstrap.lambda$unwrapJobResultException$4(ApplicationDispatcherBootstrap.java:304) >> ~[flink-dist_2.12-1.13.0.jar:1.13.0] >> at >> java.util.concurrent.CompletableFuture.uniApplyNow(CompletableFuture.java:680) >> ~[?:?] >> at >> java.util.concurrent.CompletableFuture.uniApplyStage(CompletableFuture.java:658) >> ~[?:?] >> at >> java.util.concurrent.CompletableFuture.thenApply(CompletableFuture.java:2094) >> ~[?:?] >> at >> org.apache.flink.client.deployment.application.ApplicationDispatcherBootstrap.unwrapJobResultException(ApplicationDispatcherBootstrap.java:297) >> ~[flink-dist_2.12-1.13.0.jar:1.13.0] >> at >> org.apache.flink.client.deployment.application.ApplicationDispatcherBootstrap.lambda$getApplicationResult$3(ApplicationDispatcherBootstrap.java:270) >> ~[flink-dist_2.12-1.13.0.jar:1.13.0] >> at >> java.util.stream.ReferencePipeline$3$1.accept(ReferencePipeline.java:195) >> ~[?:?] >> at >> java.util.ArrayList$ArrayListSpliterator.forEachRemaining(ArrayList.java:1655) >> ~[?:?] >> at >> java.util.stream.AbstractPipeline.copyInto(AbstractPipeline.java:484) ~[?:?] >> at >> java.util.stream.AbstractPipeline.wrapAndCopyInto(AbstractPipeline.java:474) >> ~[?:?] >> at >> java.util.stream.ReduceOps$ReduceOp.evaluateSequential(ReduceOps.java:913) >> ~[?:?] >> at >> java.util.stream.AbstractPipeline.evaluate(AbstractPipeline.java:234) ~[?:?] >> at >> java.util.stream.ReferencePipeline.collect(ReferencePipeline.java:578) >> ~[?:?] >> at >> org.apache.flink.client.deployment.application.ApplicationDispatcherBootstrap.getApplicationResult(ApplicationDispatcherBootstrap.java:272) >> ~[flink-dist_2.12-1.13.0.jar:1.13.0] >> at >> org.apache.flink.client.deployment.application.ApplicationDispatcherBootstrap.lambda$runApplicationAsync$2(ApplicationDispatcherBootstrap.java:221) >> ~[flink-dist_2.12-1.13.0.jar:1.13.0] >> at >> java.util.concurrent.CompletableFuture$UniCompose.tryFire(CompletableFuture.java:1072) >> ~[?:?] >> at >> java.util.concurrent.CompletableFuture.postComplete(CompletableFuture.java:506) >> ~[?:?] >> at >> java.util.concurrent.CompletableFuture.complete(CompletableFuture.java:2073) >> ~[?:?] >> at >> org.apache.flink.client.deployment.application.ApplicationDispatcherBootstrap.runApplicationEntryPoint(ApplicationDispatcherBootstrap.java:254) >> ~[flink-dist_2.12-1.13.0.jar:1.13.0] >> at >> org.apache.flink.client.deployment.application.ApplicationDispatcherBootstrap.lambda$runApplicationAsync$1(ApplicationDispatcherBootstrap.java:212) >> ~[flink-dist_2.12-1.13.0.jar:1.13.0] >> at >> java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:515) >> [?:?] >> at java.util.concurrent.FutureTask.run(FutureTask.java:264) [?:?] >> at >> org.apache.flink.runtime.concurrent.akka.ActorSystemScheduledExecutorAdapter$ScheduledFutureTask.run(ActorSystemScheduledExecutorAdapter.java:159) >> [flink-dist_2.12-1.13.0.jar:1.13.0] >> at akka.dispatch.TaskInvocation.run(AbstractDispatcher.scala:40) >> [flink-dist_2.12-1.13.0.jar:1.13.0] >> at >> akka.dispatch.ForkJoinExecutorConfigurator$AkkaForkJoinTask.exec(ForkJoinExecutorConfigurator.scala:44) >> [flink-dist_2.12-1.13.0.jar:1.13.0] >> at >> akka.dispatch.forkjoin.ForkJoinTask.doExec(ForkJoinTask.java:260) >> [flink-dist_2.12-1.13.0.jar:1.13.0] >> at >> akka.dispatch.forkjoin.ForkJoinPool$WorkQueue.runTask(ForkJoinPool.java:1339) >> [flink-dist_2.12-1.13.0.jar:1.13.0] >> at >> akka.dispatch.forkjoin.ForkJoinPool.runWorker(ForkJoinPool.java:1979) >> [flink-dist_2.12-1.13.0.jar:1.13.0] >> at >> akka.dispatch.forkjoin.ForkJoinWorkerThread.run(ForkJoinWorkerThread.java:107) >> [flink-dist_2.12-1.13.0.jar:1.13.0] >> Caused by: >> org.apache.flink.client.deployment.application.UnsuccessfulExecutionException: >> Application Status: FAILED >> at >> org.apache.flink.client.deployment.application.UnsuccessfulExecutionException.fromJobResult(UnsuccessfulExecutionException.java:71) >> ~[flink-dist_2.12-1.13.0.jar:1.13.0] >> ... 29 more >> Caused by: org.apache.flink.runtime.client.JobExecutionException: Job >> execution failed. >> at >> org.apache.flink.runtime.jobmaster.JobResult.toJobExecutionResult(JobResult.java:144) >> ~[flink-dist_2.12-1.13.0.jar:1.13.0] >> at >> org.apache.flink.client.deployment.application.UnsuccessfulExecutionException.fromJobResult(UnsuccessfulExecutionException.java:60) >> ~[flink-dist_2.12-1.13.0.jar:1.13.0] >> ... 29 more >> Caused by: java.util.concurrent.CompletionException: >> java.util.concurrent.CompletionException: java.lang.IllegalStateException: >> Incompatible subtask mappings: are multiple operators ingesting/producing >> intermediate results with varying degrees of parallelism?Found >> RescaleMappings{mappings=[[0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, >> 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29], [30, 31, >> 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, >> 51, 52, 53, 54, 55, 56, 57, 58, 59], [60, 61, 62, 63, 64, 65, 66, 67, 68, >> 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, >> 88, 89], [90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, >> 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119], >> [120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, >> 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149], >> [150, 151, 152, 153, 154, 155, 156, 157, 158, 159, 160, 161, 162, 163, 164, >> 165, 166, 167, 168, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179], >> [180, 181, 182, 183, 184, 185, 186, 187, 188, 189, 190, 191, 192, 193, 194, >> 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, >> 209]]} and RescaleMappings{mappings=[[0, 7, 14, 21, 28, 35, 42, 49, 56, 63, >> 70, 77, 84, 91, 98, 105, 112, 119, 126, 133, 140, 147, 154, 161, 168, 175, >> 182, 189, 196, 203], [1, 8, 15, 22, 29, 36, 43, 50, 57, 64, 71, 78, 85, 92, >> 99, 106, 113, 120, 127, 134, 141, 148, 155, 162, 169, 176, 183, 190, 197, >> 204], [2, 9, 16, 23, 30, 37, 44, 51, 58, 65, 72, 79, 86, 93, 100, 107, 114, >> 121, 128, 135, 142, 149, 156, 163, 170, 177, 184, 191, 198, 205], [3, 10, >> 17, 24, 31, 38, 45, 52, 59, 66, 73, 80, 87, 94, 101, 108, 115, 122, 129, >> 136, 143, 150, 157, 164, 171, 178, 185, 192, 199, 206], [4, 11, 18, 25, 32, >> 39, 46, 53, 60, 67, 74, 81, 88, 95, 102, 109, 116, 123, 130, 137, 144, 151, >> 158, 165, 172, 179, 186, 193, 200, 207], [5, 12, 19, 26, 33, 40, 47, 54, >> 61, 68, 75, 82, 89, 96, 103, 110, 117, 124, 131, 138, 145, 152, 159, 166, >> 173, 180, 187, 194, 201, 208], [6, 13, 20, 27, 34, 41, 48, 55, 62, 69, 76, >> 83, 90, 97, 104, 111, 118, 125, 132, 139, 146, 153, 160, 167, 174, 181, >> 188, 195, 202, 209]]}. >> at >> org.apache.flink.runtime.concurrent.FutureUtils.lambda$switchExecutor$24(FutureUtils.java:1377) >> ~[flink-dist_2.12-1.13.0.jar:1.13.0] >> at >> java.util.concurrent.CompletableFuture.uniHandle(CompletableFuture.java:930) >> ~[?:?] >> at >> java.util.concurrent.CompletableFuture$UniHandle.tryFire(CompletableFuture.java:907) >> ~[?:?] >> at >> java.util.concurrent.CompletableFuture$Completion.run(CompletableFuture.java:478) >> ~[?:?] >> at >> org.apache.flink.runtime.rpc.akka.AkkaRpcActor.handleRunAsync(AkkaRpcActor.java:440) >> ~[flink-dist_2.12-1.13.0.jar:1.13.0] >> at >> org.apache.flink.runtime.rpc.akka.AkkaRpcActor.handleRpcMessage(AkkaRpcActor.java:208) >> ~[flink-dist_2.12-1.13.0.jar:1.13.0] >> at >> org.apache.flink.runtime.rpc.akka.FencedAkkaRpcActor.handleRpcMessage(FencedAkkaRpcActor.java:77) >> ~[flink-dist_2.12-1.13.0.jar:1.13.0] >> at >> org.apache.flink.runtime.rpc.akka.AkkaRpcActor.handleMessage(AkkaRpcActor.java:158) >> ~[flink-dist_2.12-1.13.0.jar:1.13.0] >> at akka.japi.pf.UnitCaseStatement.apply(CaseStatements.scala:26) >> ~[flink-dist_2.12-1.13.0.jar:1.13.0] >> at akka.japi.pf.UnitCaseStatement.apply(CaseStatements.scala:21) >> ~[flink-dist_2.12-1.13.0.jar:1.13.0] >> at scala.PartialFunction.applyOrElse(PartialFunction.scala:123) >> ~[flink-dist_2.12-1.13.0.jar:1.13.0] >> at scala.PartialFunction.applyOrElse$(PartialFunction.scala:122) >> ~[flink-dist_2.12-1.13.0.jar:1.13.0] >> at >> akka.japi.pf.UnitCaseStatement.applyOrElse(CaseStatements.scala:21) >> ~[flink-dist_2.12-1.13.0.jar:1.13.0] >> at >> scala.PartialFunction$OrElse.applyOrElse(PartialFunction.scala:171) >> ~[flink-dist_2.12-1.13.0.jar:1.13.0] >> at >> scala.PartialFunction$OrElse.applyOrElse(PartialFunction.scala:172) >> ~[flink-dist_2.12-1.13.0.jar:1.13.0] >> at >> scala.PartialFunction$OrElse.applyOrElse(PartialFunction.scala:172) >> ~[flink-dist_2.12-1.13.0.jar:1.13.0] >> at akka.actor.Actor.aroundReceive(Actor.scala:517) >> ~[flink-dist_2.12-1.13.0.jar:1.13.0] >> at akka.actor.Actor.aroundReceive$(Actor.scala:515) >> ~[flink-dist_2.12-1.13.0.jar:1.13.0] >> at >> akka.actor.AbstractActor.aroundReceive(AbstractActor.scala:225) >> ~[flink-dist_2.12-1.13.0.jar:1.13.0] >> at akka.actor.ActorCell.receiveMessage(ActorCell.scala:592) >> ~[flink-dist_2.12-1.13.0.jar:1.13.0] >> at akka.actor.ActorCell.invoke(ActorCell.scala:561) >> ~[flink-dist_2.12-1.13.0.jar:1.13.0] >> at akka.dispatch.Mailbox.processMailbox(Mailbox.scala:258) >> ~[flink-dist_2.12-1.13.0.jar:1.13.0] >> at akka.dispatch.Mailbox.run(Mailbox.scala:225) >> ~[flink-dist_2.12-1.13.0.jar:1.13.0] >> at akka.dispatch.Mailbox.exec(Mailbox.scala:235) >> ~[flink-dist_2.12-1.13.0.jar:1.13.0] >> ... 4 more >> Caused by: java.util.concurrent.CompletionException: >> java.lang.IllegalStateException: Incompatible subtask mappings: are >> multiple operators ingesting/producing intermediate results with varying >> degrees of parallelism?Found RescaleMappings{mappings=[[0, 1, 2, 3, 4, 5, >> 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, >> 26, 27, 28, 29], [30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, >> 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59], [60, 61, >> 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, >> 81, 82, 83, 84, 85, 86, 87, 88, 89], [90, 91, 92, 93, 94, 95, 96, 97, 98, >> 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, >> 114, 115, 116, 117, 118, 119], [120, 121, 122, 123, 124, 125, 126, 127, >> 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, >> 143, 144, 145, 146, 147, 148, 149], [150, 151, 152, 153, 154, 155, 156, >> 157, 158, 159, 160, 161, 162, 163, 164, 165, 166, 167, 168, 169, 170, 171, >> 172, 173, 174, 175, 176, 177, 178, 179], [180, 181, 182, 183, 184, 185, >> 186, 187, 188, 189, 190, 191, 192, 193, 194, 195, 196, 197, 198, 199, 200, >> 201, 202, 203, 204, 205, 206, 207, 208, 209]]} and >> RescaleMappings{mappings=[[0, 7, 14, 21, 28, 35, 42, 49, 56, 63, 70, 77, >> 84, 91, 98, 105, 112, 119, 126, 133, 140, 147, 154, 161, 168, 175, 182, >> 189, 196, 203], [1, 8, 15, 22, 29, 36, 43, 50, 57, 64, 71, 78, 85, 92, 99, >> 106, 113, 120, 127, 134, 141, 148, 155, 162, 169, 176, 183, 190, 197, 204], >> [2, 9, 16, 23, 30, 37, 44, 51, 58, 65, 72, 79, 86, 93, 100, 107, 114, 121, >> 128, 135, 142, 149, 156, 163, 170, 177, 184, 191, 198, 205], [3, 10, 17, >> 24, 31, 38, 45, 52, 59, 66, 73, 80, 87, 94, 101, 108, 115, 122, 129, 136, >> 143, 150, 157, 164, 171, 178, 185, 192, 199, 206], [4, 11, 18, 25, 32, 39, >> 46, 53, 60, 67, 74, 81, 88, 95, 102, 109, 116, 123, 130, 137, 144, 151, >> 158, 165, 172, 179, 186, 193, 200, 207], [5, 12, 19, 26, 33, 40, 47, 54, >> 61, 68, 75, 82, 89, 96, 103, 110, 117, 124, 131, 138, 145, 152, 159, 166, >> 173, 180, 187, 194, 201, 208], [6, 13, 20, 27, 34, 41, 48, 55, 62, 69, 76, >> 83, 90, 97, 104, 111, 118, 125, 132, 139, 146, 153, 160, 167, 174, 181, >> 188, 195, 202, 209]]}. >> at >> org.apache.flink.runtime.scheduler.adaptive.BackgroundTask.lambda$new$0(BackgroundTask.java:59) >> ~[flink-dist_2.12-1.13.0.jar:1.13.0] >> at >> java.util.concurrent.CompletableFuture$UniApply.tryFire(CompletableFuture.java:642) >> ~[?:?] >> at >> java.util.concurrent.CompletableFuture$Completion.run(CompletableFuture.java:478) >> ~[?:?] >> at >> java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:515) >> ~[?:?] >> at java.util.concurrent.FutureTask.run(FutureTask.java:264) ~[?:?] >> at >> java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.run(ScheduledThreadPoolExecutor.java:304) >> ~[?:?] >> at >> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1128) >> ~[?:?] >> at >> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:628) >> ~[?:?] >> at java.lang.Thread.run(Thread.java:834) ~[?:?] >> Caused by: java.lang.IllegalStateException: Incompatible subtask >> mappings: are multiple operators ingesting/producing intermediate results >> with varying degrees of parallelism?Found RescaleMappings{mappings=[[0, 1, >> 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, >> 23, 24, 25, 26, 27, 28, 29], [30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, >> 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, >> 59], [60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, >> 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89], [90, 91, 92, 93, 94, >> 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, >> 111, 112, 113, 114, 115, 116, 117, 118, 119], [120, 121, 122, 123, 124, >> 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, >> 140, 141, 142, 143, 144, 145, 146, 147, 148, 149], [150, 151, 152, 153, >> 154, 155, 156, 157, 158, 159, 160, 161, 162, 163, 164, 165, 166, 167, 168, >> 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179], [180, 181, 182, >> 183, 184, 185, 186, 187, 188, 189, 190, 191, 192, 193, 194, 195, 196, 197, >> 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209]]} and >> RescaleMappings{mappings=[[0, 7, 14, 21, 28, 35, 42, 49, 56, 63, 70, 77, >> 84, 91, 98, 105, 112, 119, 126, 133, 140, 147, 154, 161, 168, 175, 182, >> 189, 196, 203], [1, 8, 15, 22, 29, 36, 43, 50, 57, 64, 71, 78, 85, 92, 99, >> 106, 113, 120, 127, 134, 141, 148, 155, 162, 169, 176, 183, 190, 197, 204], >> [2, 9, 16, 23, 30, 37, 44, 51, 58, 65, 72, 79, 86, 93, 100, 107, 114, 121, >> 128, 135, 142, 149, 156, 163, 170, 177, 184, 191, 198, 205], [3, 10, 17, >> 24, 31, 38, 45, 52, 59, 66, 73, 80, 87, 94, 101, 108, 115, 122, 129, 136, >> 143, 150, 157, 164, 171, 178, 185, 192, 199, 206], [4, 11, 18, 25, 32, 39, >> 46, 53, 60, 67, 74, 81, 88, 95, 102, 109, 116, 123, 130, 137, 144, 151, >> 158, 165, 172, 179, 186, 193, 200, 207], [5, 12, 19, 26, 33, 40, 47, 54, >> 61, 68, 75, 82, 89, 96, 103, 110, 117, 124, 131, 138, 145, 152, 159, 166, >> 173, 180, 187, 194, 201, 208], [6, 13, 20, 27, 34, 41, 48, 55, 62, 69, 76, >> 83, 90, 97, 104, 111, 118, 125, 132, 139, 146, 153, 160, 167, 174, 181, >> 188, 195, 202, 209]]}. >> at >> org.apache.flink.runtime.checkpoint.TaskStateAssignment.checkSubtaskMapping(TaskStateAssignment.java:322) >> ~[flink-dist_2.12-1.13.0.jar:1.13.0] >> at >> org.apache.flink.runtime.checkpoint.TaskStateAssignment.getInputMapping(TaskStateAssignment.java:306) >> ~[flink-dist_2.12-1.13.0.jar:1.13.0] >> at >> org.apache.flink.runtime.checkpoint.StateAssignmentOperation.reDistributeInputChannelStates(StateAssignmentOperation.java:409) >> ~[flink-dist_2.12-1.13.0.jar:1.13.0] >> at >> org.apache.flink.runtime.checkpoint.StateAssignmentOperation.assignAttemptState(StateAssignmentOperation.java:193) >> ~[flink-dist_2.12-1.13.0.jar:1.13.0] >> at >> org.apache.flink.runtime.checkpoint.StateAssignmentOperation.assignStates(StateAssignmentOperation.java:139) >> ~[flink-dist_2.12-1.13.0.jar:1.13.0] >> at >> org.apache.flink.runtime.checkpoint.CheckpointCoordinator.restoreLatestCheckpointedStateInternal(CheckpointCoordinator.java:1566) >> ~[flink-dist_2.12-1.13.0.jar:1.13.0] >> at >> org.apache.flink.runtime.checkpoint.CheckpointCoordinator.restoreSavepoint(CheckpointCoordinator.java:1646) >> ~[flink-dist_2.12-1.13.0.jar:1.13.0] >> at >> org.apache.flink.runtime.scheduler.DefaultExecutionGraphFactory.tryRestoreExecutionGraphFromSavepoint(DefaultExecutionGraphFactory.java:163) >> ~[flink-dist_2.12-1.13.0.jar:1.13.0] >> at >> org.apache.flink.runtime.scheduler.DefaultExecutionGraphFactory.createAndRestoreExecutionGraph(DefaultExecutionGraphFactory.java:138) >> ~[flink-dist_2.12-1.13.0.jar:1.13.0] >> at >> org.apache.flink.runtime.scheduler.adaptive.AdaptiveScheduler.createExecutionGraphAndRestoreState(AdaptiveScheduler.java:986) >> ~[flink-dist_2.12-1.13.0.jar:1.13.0] >> at >> org.apache.flink.runtime.scheduler.adaptive.AdaptiveScheduler.lambda$createExecutionGraphAndRestoreStateAsync$25(AdaptiveScheduler.java:976) >> ~[flink-dist_2.12-1.13.0.jar:1.13.0] >> at >> org.apache.flink.runtime.scheduler.adaptive.BackgroundTask.lambda$new$0(BackgroundTask.java:57) >> ~[flink-dist_2.12-1.13.0.jar:1.13.0] >> at >> java.util.concurrent.CompletableFuture$UniApply.tryFire(CompletableFuture.java:642) >> ~[?:?] >> at >> java.util.concurrent.CompletableFuture$Completion.run(CompletableFuture.java:478) >> ~[?:?] >> at >> java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:515) >> ~[?:?] >> at java.util.concurrent.FutureTask.run(FutureTask.java:264) ~[?:?] >> at >> java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.run(ScheduledThreadPoolExecutor.java:304) >> ~[?:?] >> at >> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1128) >> ~[?:?] >> at >> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:628) >> ~[?:?] >> at java.lang.Thread.run(Thread.java:834) ~[?:?] >> 2021-05-14 01:55:37,916 INFO >> org.apache.flink.runtime.entrypoint.ClusterEntrypoint [] - Shutting >> StandaloneApplicationClusterEntryPoint down with application status FAILED. >> Diagnostics null. >> 2021-05-14 01:55:37,917 INFO >> org.apache.flink.runtime.jobmaster.MiniDispatcherRestEndpoint [] - Shutting >> down rest endpoint. >> >> >> -- >> ChangZhuo Chen (陳昌倬) czchen@{czchen,debian}.org >> http://czchen.info/ >> Key fingerprint = BA04 346D C2E1 FE63 C790 8793 CC65 B0CD EC27 5D5B >> >