Lewin Ma created ZEPPELIN-4332:
----------------------------------
Summary: Linkage error of bare-metal zeppelin to flink on k8s
Key: ZEPPELIN-4332
URL: https://issues.apache.org/jira/browse/ZEPPELIN-4332
Project: Zeppelin
Issue Type: Bug
Components: flink
Affects Versions: 0.8.1
Reporter: Lewin Ma
I have deployed a flink cluster by kubernetes. Then start a zeppelin daemon and
configed a flink interpreter. But is logged:
```bash
text: org.apache.flink.api.scala.DataSet[String] =
org.apache.flink.api.scala.DataSet@9c8f964 counts:
org.apache.flink.api.scala.AggregateDataSet[(String, Int)] =
org.apache.flink.api.scala.AggregateDataSet@41d66e72
org.apache.flink.client.program.ProgramInvocationException: The program
execution failed: Communication with JobManager failed: Lost connection to the
JobManager. at
org.apache.flink.client.program.ClusterClient.run(ClusterClient.java:409) at
org.apache.flink.client.program.StandaloneClusterClient.submitJob(StandaloneClusterClient.java:95)
at org.apache.flink.client.program.ClusterClient.run(ClusterClient.java:382)
at org.apache.flink.client.program.ClusterClient.run(ClusterClient.java:369) at
org.apache.flink.client.program.ClusterClient.run(ClusterClient.java:344) at
org.apache.flink.client.RemoteExecutor.executePlanWithJars(RemoteExecutor.java:211)
at org.apache.flink.client.RemoteExecutor.executePlan(RemoteExecutor.java:188)
at
org.apache.flink.api.java.RemoteEnvironment.execute(RemoteEnvironment.java:172)
at
org.apache.flink.api.java.ExecutionEnvironment.execute(ExecutionEnvironment.java:896)
at
org.apache.flink.api.scala.ExecutionEnvironment.execute(ExecutionEnvironment.scala:637)
at org.apache.flink.api.scala.DataSet.collect(DataSet.scala:547) ... 36 elided
Caused by: org.apache.flink.runtime.client.JobExecutionException: Communication
with JobManager failed: Lost connection to the JobManager. at
org.apache.flink.runtime.client.JobClient.submitJobAndWait(JobClient.java:137)
at org.apache.flink.client.program.ClusterClient.run(ClusterClient.java:405)
... 46 more Caused by:
org.apache.flink.runtime.client.JobClientActorConnectionTimeoutException: Lost
connection to the JobManager. at
org.apache.flink.runtime.client.JobClientActor.handleMessage(JobClientActor.java:252)
at
org.apache.flink.runtime.akka.FlinkUntypedActor.handleLeaderSessionID(FlinkUntypedActor.java:90)
at
org.apache.flink.runtime.akka.FlinkUntypedActor.onReceive(FlinkUntypedActor.java:70)
at
akka.actor.UntypedActor$$anonfun$receive$1.applyOrElse(UntypedActor.scala:167)
at akka.actor.Actor$class.aroundReceive(Actor.scala:465) at
akka.actor.UntypedActor.aroundReceive(UntypedActor.scala:97) at
akka.actor.ActorCell.receiveMessage(ActorCell.scala:516) at
akka.actor.ActorCell.invoke(ActorCell.scala:487) at
akka.dispatch.Mailbox.processMailbox(Mailbox.scala:254) at
akka.dispatch.Mailbox.run(Mailbox.scala:221) at
akka.dispatch.Mailbox.exec(Mailbox.scala:231) at
scala.concurrent.forkjoin.ForkJoinTask.doExec(ForkJoinTask.java:260) at
scala.concurrent.forkjoin.ForkJoinPool$WorkQueue.pollAndExecAll(ForkJoinPool.java:1253)
at
scala.concurrent.forkjoin.ForkJoinPool$WorkQueue.runTask(ForkJoinPool.java:1346)
at scala.concurrent.forkjoin.ForkJoinPool.runWorker(ForkJoinPool.java:1979) at
scala.concurrent.forkjoin.ForkJoinWorkerThread.run(ForkJoinWorkerThread.java:107)
```
I can make a success telnet command from the node of zeppelin to the kubernetes
flink jobmanager service port
--
This message was sent by Atlassian Jira
(v8.3.2#803003)