Hi 如果我理解没错的话,是否添加 -d 会使用不同的模式启动作业(PerJob 和 Session 模式),从错误栈来看猜测是版本冲突了导致的,你有尝试过最新的 1.11 是否还有这个问题吗? Best, Congxian
bradyMk <[email protected]> 于2020年8月14日周五 下午6:52写道: > 请问大家: > 我采用如下命令提交: > flink run \ > -m yarn-cluster \ > -yn 3 \ > -ys 3 \ > -yjm 2048m \ > -ytm 2048m \ > -ynm flink_test \ > -d \ > -c net.realtime.app.FlinkTest ./hotmall-flink.jar > 就会失败,报错信息如下: > [AMRM Callback Handler Thread] ERROR > org.apache.flink.yarn.YarnResourceManager - Fatal error occurred in > ResourceManager. > java.lang.NoSuchMethodError: > > org.apache.hadoop.yarn.api.protocolrecords.AllocateRequest.newInstance(IFLjava/util/List;Ljava/util/List;Ljava/util/List;Lorg/apache/hadoop/yarn/api/records/ResourceBlacklistRequest;)Lorg/apache/hadoop/yarn/api/protocolrecords/AllocateRequest; > at > > org.apache.hadoop.yarn.client.api.impl.AMRMClientImpl.allocate(AMRMClientImpl.java:279) > at > > org.apache.hadoop.yarn.client.api.async.impl.AMRMClientAsyncImpl$HeartbeatThread.run(AMRMClientAsyncImpl.java:273) > [AMRM Callback Handler Thread] ERROR > org.apache.flink.runtime.entrypoint.ClusterEntrypoint - Fatal error > occurred > in the cluster entrypoint. > java.lang.NoSuchMethodError: > > org.apache.hadoop.yarn.api.protocolrecords.AllocateRequest.newInstance(IFLjava/util/List;Ljava/util/List;Ljava/util/List;Lorg/apache/hadoop/yarn/api/records/ResourceBlacklistRequest;)Lorg/apache/hadoop/yarn/api/protocolrecords/AllocateRequest; > at > > org.apache.hadoop.yarn.client.api.impl.AMRMClientImpl.allocate(AMRMClientImpl.java:279) > at > > org.apache.hadoop.yarn.client.api.async.impl.AMRMClientAsyncImpl$HeartbeatThread.run(AMRMClientAsyncImpl.java:273) > [flink-akka.actor.default-dispatcher-2] INFO > org.apache.flink.yarn.YarnResourceManager - ResourceManager > akka.tcp://[email protected]:33650/user/resourcemanager > was > granted leadership with fencing token 00000000000000000000000000000000 > [BlobServer shutdown hook] INFO org.apache.flink.runtime.blob.BlobServer - > Stopped BLOB server at 0.0.0.0:36247 > < > http://apache-flink.147419.n8.nabble.com/file/t802/%E6%8D%95%E8%8E%B71111.png> > > 但是我在提交命令时,不加-d,就可以正常提交运行;更奇怪的是,我运行另一个任务,加了-d参数,可以正常提交。 > 我这个提交失败的任务开始是用如下命令运行的: > nohup flink run \ > -m yarn-cluster \ > -yn 3 \ > -ys 3 \ > -yjm 2048m \ > -ytm 2048m \ > -ynm flink_test \ > -c net.realtime.app.FlinkTest ./hotmall-flink.jar > /logs/flink.log 2>&1 & > > /logs/nohup.out 2>&1 & > > 在这个任务挂掉之后,再用-d的方式重启就会出现我开始说的问题,很奇怪,有大佬知道为什么么? > > > > ----- > Best Wishes > -- > Sent from: http://apache-flink.147419.n8.nabble.com/ >
