[
https://issues.apache.org/jira/browse/FLINK-22500?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17334574#comment-17334574
]
Yang Wang commented on FLINK-22500:
-----------------------------------
Could you share the JobManager logs so that we could verify that the job is
running with ZERO job id?
> flink stop 命令找不到00000000000000000000000000000000
> ------------------------------------------------
>
> Key: FLINK-22500
> URL: https://issues.apache.org/jira/browse/FLINK-22500
> Project: Flink
> Issue Type: Bug
> Components: Deployment / Kubernetes
> Reporter: 陈孝忠
> Priority: Major
>
> flin 1.12.1.版本, k8s applicton mode 使用K8S 做HA
> stop 命令和 run 都会报不到00000000000000000000000000000000
> 下面是 STOP 的日志
> 开始提交任务:2021-04-28 06:18:37开始提交任务:2021-04-28 06:18:37启动命令:/opt/flink/bin/flink
> stop 00000000000000000000000000000000 --target kubernetes-application
> -Dkubernetes.namespace=middle-flink
> -Dkubernetes.config.file=/opt/flink/conf/kubeconfig
> -Dkubernetes.cluster-id=test0001Suspending job
> "00000000000000000000000000000000" with a savepoint.
> 2021-04-28 06:18:39,574 INFO
> org.apache.flink.kubernetes.KubernetesClusterDescriptor [] - Retrieve
> flink cluster market-data-full-chain-0427 successfully, JobManager Web
> Interface: http://market-data-full-chain-0427-rest.middle-flink:10243
> rs=1
> ------------------------------------------------------------ The program
> finished with the following exception:
> org.apache.flink.util.FlinkException: Could not stop with a savepoint job
> "00000000000000000000000000000000". at
> org.apache.flink.client.cli.CliFrontend.lambda$stop$5(CliFrontend.java:585)
> at
> org.apache.flink.client.cli.CliFrontend.runClusterAction(CliFrontend.java:1006)
> at org.apache.flink.client.cli.CliFrontend.stop(CliFrontend.java:573) at
> org.apache.flink.client.cli.CliFrontend.parseAndRun(CliFrontend.java:1073) at
> org.apache.flink.client.cli.CliFrontend.lambda$main$10(CliFrontend.java:1136)
> at
> org.apache.flink.runtime.security.contexts.NoOpSecurityContext.runSecured(NoOpSecurityContext.java:28)
> at org.apache.flink.client.cli.CliFrontend.main(CliFrontend.java:1136)Caused
> by: java.util.concurrent.ExecutionException:
> java.util.concurrent.CompletionException:
> org.apache.flink.runtime.messages.FlinkJobNotFoundException: Could not find
> Flink job (00000000000000000000000000000000) at
> java.util.concurrent.CompletableFuture.reportGet(CompletableFuture.java:357)
> at java.util.concurrent.CompletableFuture.get(CompletableFuture.java:1928) at
> org.apache.flink.client.cli.CliFrontend.lambda$stop$5(CliFrontend.java:583)
> ... 6 moreCaused by: java.util.concurrent.CompletionException:
> org.apache.flink.runtime.messages.FlinkJobNotFoundException: Could not find
> Flink job (00000000000000000000000000000000) at
> java.util.concurrent.CompletableFuture.encodeThrowable(CompletableFuture.java:292)
> at
> java.util.concurrent.CompletableFuture.uniComposeStage(CompletableFuture.java:989)
> at
> java.util.concurrent.CompletableFuture.thenCompose(CompletableFuture.java:2137)
> at
> org.apache.flink.runtime.dispatcher.Dispatcher.performOperationOnJobMasterGateway(Dispatcher.java:910)
> at
> org.apache.flink.runtime.dispatcher.Dispatcher.stopWithSavepoint(Dispatcher.java:709)
> at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) at
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
> at
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
> at java.lang.reflect.Method.invoke(Method.java:498) at
> org.apache.flink.runtime.rpc.akka.AkkaRpcActor.handleRpcInvocation(AkkaRpcActor.java:306)
> at
> org.apache.flink.runtime.rpc.akka.AkkaRpcActor.handleRpcMessage(AkkaRpcActor.java:213)
> at
> org.apache.flink.runtime.rpc.akka.FencedAkkaRpcActor.handleRpcMessage(FencedAkkaRpcActor.java:77)
> at
> org.apache.flink.runtime.rpc.akka.AkkaRpcActor.handleMessage(AkkaRpcActor.java:159)
> at akka.japi.pf.UnitCaseStatement.apply(CaseStatements.scala:26) at
> akka.japi.pf.UnitCaseStatement.apply(CaseStatements.scala:21) at
> scala.PartialFunction.applyOrElse(PartialFunction.scala:123) at
> scala.PartialFunction.applyOrElse$(PartialFunction.scala:122) at
> akka.japi.pf.UnitCaseStatement.applyOrElse(CaseStatements.scala:21) at
> scala.PartialFunction$OrElse.applyOrElse(PartialFunction.scala:171) at
> scala.PartialFunction$OrElse.applyOrElse(PartialFunction.scala:172) at
> scala.PartialFunction$OrElse.applyOrElse(PartialFunction.scala:172) at
> akka.actor.Actor.aroundReceive(Actor.scala:517) at
> akka.actor.Actor.aroundReceive$(Actor.scala:515) at
> akka.actor.AbstractActor.aroundReceive(AbstractActor.scala:225) at
> akka.actor.ActorCell.receiveMessage(ActorCell.scala:592) at
> akka.actor.ActorCell.invoke(ActorCell.scala:561) at
> akka.dispatch.Mailbox.processMailbox(Mailbox.scala:258) at
> akka.dispatch.Mailbox.run(Mailbox.scala:225) at
> akka.dispatch.Mailbox.exec(Mailbox.scala:235) at
> akka.dispatch.forkjoin.ForkJoinTask.doExec(ForkJoinTask.java:260) at
> akka.dispatch.forkjoin.ForkJoinPool$WorkQueue.runTask(ForkJoinPool.java:1339)
> at akka.dispatch.forkjoin.ForkJoinPool.runWorker(ForkJoinPool.java:1979) at
> akka.dispatch.forkjoin.ForkJoinWorkerThread.run(ForkJoinWorkerThread.java:107)Caused
> by: org.apache.flink.runtime.messages.FlinkJobNotFoundException: Could not
> find Flink job (00000000000000000000000000000000) at
> org.apache.flink.runtime.dispatcher.Dispatcher.getJobMasterGateway(Dispatcher.java:897)
> ... 30 more
> pcs.waitFor() 执行异常 rs=1java.lang.Exception: pcs.waitFor() is error
> rs=1启动结束时间: 2021-04-28 06:19:17
> ######启动结果是 失败##############################
--
This message was sent by Atlassian Jira
(v8.3.4#803005)