您好,我的版本是1.13.1
 
 
------------------ Original ------------------
From: &nbsp;"Yang Wang"<danrtsey...@gmail.com&gt;;
Date: &nbsp;Tue, Aug 17, 2021 03:25 PM
To: &nbsp;"user-zh"<user-zh@flink.apache.org&gt;; 

Subject: &nbsp;Re: Flink On Yarn HA 部署模式下Flink程序无法启动

&nbsp;

看报错应该是个已知问题[1]并且已经在1.11.2中修复

[1]. https://issues.apache.org/jira/browse/FLINK-19212

Best,
Yang

周瑞 <rui.z...@woqutech.com&gt; 于2021年8月17日周二 上午11:04写道:

&gt; 您好:Flink程序部署在Yran上以Appliation Mode 模式启动的,在没有采用HA
&gt; 模式的时候可以正常启动,配置了HA之后,启动异常,麻烦帮忙看下是什么原因导致的.
&gt;
&gt;
&gt; HA 配置如下:
&gt; high-availability: zookeeper high-availability.storageDir:
&gt; hdfs://mycluster/flink/ha high-availability.zookeeper.quorum:
&gt; zk-1:2181,zk-2:2181,zk-3:2181 high-availability.zookeeper.path.root: /flink
&gt; high-availability.cluster-id: /flink_cluster
&gt;
&gt;
&gt; 异常如下:
&gt; 2021-08-17 10:24:18,938 INFO
&gt; org.apache.flink.runtime.leaderelection.DefaultLeaderElectionService [] -
&gt; Starting DefaultLeaderElectionService with
&gt; ZooKeeperLeaderElectionDriver{leaderPath='/leader/resource_manager_lock'}.
&gt; 2021-08-17 10:25:09,706 ERROR
&gt; org.apache.flink.runtime.rest.handler.taskmanager.TaskManagerDetailsHandler
&gt; [] - Unhandled exception.
&gt; org.apache.flink.runtime.rpc.akka.exceptions.AkkaRpcException: Failed to
&gt; serialize the result for RPC call : requestTaskManagerDetailsInfo.
&gt; &amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;at
&gt; 
org.apache.flink.runtime.rpc.akka.AkkaRpcActor.serializeRemoteResultAndVerifySize(AkkaRpcActor.java:404)
&gt; ~[flink-dist_2.12-1.13.1.jar:1.13.1]
&gt; &amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;at
&gt; 
org.apache.flink.runtime.rpc.akka.AkkaRpcActor.lambda$sendAsyncResponse$0(AkkaRpcActor.java:360)
&gt; ~[flink-dist_2.12-1.13.1.jar:1.13.1]
&gt; &amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;at
&gt; 
java.util.concurrent.CompletableFuture.uniHandle(CompletableFuture.java:836)
&gt; ~[?:1.8.0_292]
&gt; &amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;at
&gt; 
java.util.concurrent.CompletableFuture.uniHandleStage(CompletableFuture.java:848)
&gt; ~[?:1.8.0_292]
&gt; &amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;at
&gt; java.util.concurrent.CompletableFuture.handle(CompletableFuture.java:2168)
&gt; ~[?:1.8.0_292]
&gt; &amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;at
&gt; 
org.apache.flink.runtime.rpc.akka.AkkaRpcActor.sendAsyncResponse(AkkaRpcActor.java:352)
&gt; ~[flink-dist_2.12-1.13.1.jar:1.13.1]
&gt; &amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;at
&gt; 
org.apache.flink.runtime.rpc.akka.AkkaRpcActor.handleRpcInvocation(AkkaRpcActor.java:319)
&gt; ~[flink-dist_2.12-1.13.1.jar:1.13.1]
&gt; &amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;at
&gt; 
org.apache.flink.runtime.rpc.akka.AkkaRpcActor.handleRpcMessage(AkkaRpcActor.java:212)
&gt; ~[flink-dist_2.12-1.13.1.jar:1.13.1]
&gt; &amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;at
&gt; 
org.apache.flink.runtime.rpc.akka.FencedAkkaRpcActor.handleRpcMessage(FencedAkkaRpcActor.java:77)
&gt; ~[flink-dist_2.12-1.13.1.jar:1.13.1]
&gt; &amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;at
&gt; 
org.apache.flink.runtime.rpc.akka.AkkaRpcActor.handleMessage(AkkaRpcActor.java:158)
&gt; ~[flink-dist_2.12-1.13.1.jar:1.13.1]
&gt; &amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;at 
akka.japi.pf.UnitCaseStatement.apply(CaseStatements.scala:26)
&gt; ~[flink-dist_2.12-1.13.1.jar:1.13.1]
&gt; &amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;at 
akka.japi.pf.UnitCaseStatement.apply(CaseStatements.scala:21)
&gt; ~[flink-dist_2.12-1.13.1.jar:1.13.1]
&gt; &amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;at
&gt; scala.PartialFunction.applyOrElse(PartialFunction.scala:123)
&gt; ~[flink-dist_2.12-1.13.1.jar:1.13.1]
&gt; &amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;at
&gt; scala.PartialFunction.applyOrElse$(PartialFunction.scala:122)
&gt; ~[flink-dist_2.12-1.13.1.jar:1.13.1]
&gt; &amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;at 
akka.japi.pf.UnitCaseStatement.applyOrElse(CaseStatements.scala:21)
&gt; ~[flink-dist_2.12-1.13.1.jar:1.13.1]
&gt; &amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;at
&gt; scala.PartialFunction$OrElse.applyOrElse(PartialFunction.scala:171)
&gt; ~[flink-dist_2.12-1.13.1.jar:1.13.1]
&gt; &amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;at
&gt; scala.PartialFunction$OrElse.applyOrElse(PartialFunction.scala:172)
&gt; ~[flink-dist_2.12-1.13.1.jar:1.13.1]
&gt; &amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;at
&gt; scala.PartialFunction$OrElse.applyOrElse(PartialFunction.scala:172)
&gt; ~[flink-dist_2.12-1.13.1.jar:1.13.1]
&gt; &amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;at 
akka.actor.Actor.aroundReceive(Actor.scala:517)
&gt; ~[flink-dist_2.12-1.13.1.jar:1.13.1]
&gt; &amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;at
&gt; akka.actor.Actor.aroundReceive$(Actor.scala:515)
&gt; ~[flink-dist_2.12-1.13.1.jar:1.13.1]
&gt; &amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;at
&gt; akka.actor.AbstractActor.aroundReceive(AbstractActor.scala:225)
&gt; ~[flink-dist_2.12-1.13.1.jar:1.13.1]
&gt; &amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;at
&gt; akka.actor.ActorCell.receiveMessage(ActorCell.scala:592)
&gt; [flink-dist_2.12-1.13.1.jar:1.13.1]
&gt; &amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;at
&gt; akka.actor.ActorCell.invoke(ActorCell.scala:561)
&gt; [flink-dist_2.12-1.13.1.jar:1.13.1]
&gt; &amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;at
&gt; akka.dispatch.Mailbox.processMailbox(Mailbox.scala:258)
&gt; [flink-dist_2.12-1.13.1.jar:1.13.1]
&gt; &amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;at 
akka.dispatch.Mailbox.run(Mailbox.scala:225)
&gt; [flink-dist_2.12-1.13.1.jar:1.13.1]
&gt; &amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;at 
akka.dispatch.Mailbox.exec(Mailbox.scala:235)
&gt; [flink-dist_2.12-1.13.1.jar:1.13.1]
&gt; &amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;at
&gt; akka.dispatch.forkjoin.ForkJoinTask.doExec(ForkJoinTask.java:260)
&gt; [flink-dist_2.12-1.13.1.jar:1.13.1]
&gt; &amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;at
&gt; 
akka.dispatch.forkjoin.ForkJoinPool$WorkQueue.runTask(ForkJoinPool.java:1339)
&gt; [flink-dist_2.12-1.13.1.jar:1.13.1]
&gt; &amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;at
&gt; akka.dispatch.forkjoin.ForkJoinPool.runWorker(ForkJoinPool.java:1979)
&gt; [flink-dist_2.12-1.13.1.jar:1.13.1]
&gt; &amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;at
&gt; 
akka.dispatch.forkjoin.ForkJoinWorkerThread.run(ForkJoinWorkerThread.java:107)
&gt; [flink-dist_2.12-1.13.1.jar:1.13.1]
&gt; Caused by: java.io.NotSerializableException:
&gt; org.apache.flink.runtime.resourcemanager.TaskManagerInfoWithSlots
&gt; &amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;at
&gt; java.io.ObjectOutputStream.writeObject0(ObjectOutputStream.java:1184)
&gt; ~[?:1.8.0_292]
&gt; &amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;at
&gt; java.io.ObjectOutputStream.writeObject(ObjectOutputStream.java:348)
&gt; ~[?:1.8.0_292]
&gt; &amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;at
&gt; 
org.apache.flink.util.InstantiationUtil.serializeObject(InstantiationUtil.java:624)
&gt; ~[flink-dist_2.12-1.13.1.jar:1.13.1]
&gt; &amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;at
&gt; 
org.apache.flink.runtime.rpc.akka.AkkaRpcSerializedValue.valueOf(AkkaRpcSerializedValue.java:66)
&gt; ~[flink-dist_2.12-1.13.1.jar:1.13.1]
&gt; &amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;at
&gt; 
org.apache.flink.runtime.rpc.akka.AkkaRpcActor.serializeRemoteResultAndVerifySize(AkkaRpcActor.java:387)
&gt; ~[flink-dist_2.12-1.13.1.jar:1.13.1]
&gt; &amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;... 29 more

回复