Lewin Ma created ZEPPELIN-4332:
----------------------------------

             Summary: Linkage error of bare-metal zeppelin to flink on k8s
                 Key: ZEPPELIN-4332
                 URL: https://issues.apache.org/jira/browse/ZEPPELIN-4332
             Project: Zeppelin
          Issue Type: Bug
          Components: flink
    Affects Versions: 0.8.1
            Reporter: Lewin Ma


I have deployed a flink cluster by kubernetes. Then start a zeppelin daemon and 
configed a flink interpreter. But is logged:

```bash

text: org.apache.flink.api.scala.DataSet[String] = 
org.apache.flink.api.scala.DataSet@9c8f964 counts: 
org.apache.flink.api.scala.AggregateDataSet[(String, Int)] = 
org.apache.flink.api.scala.AggregateDataSet@41d66e72 
org.apache.flink.client.program.ProgramInvocationException: The program 
execution failed: Communication with JobManager failed: Lost connection to the 
JobManager. at 
org.apache.flink.client.program.ClusterClient.run(ClusterClient.java:409) at 
org.apache.flink.client.program.StandaloneClusterClient.submitJob(StandaloneClusterClient.java:95)
 at org.apache.flink.client.program.ClusterClient.run(ClusterClient.java:382) 
at org.apache.flink.client.program.ClusterClient.run(ClusterClient.java:369) at 
org.apache.flink.client.program.ClusterClient.run(ClusterClient.java:344) at 
org.apache.flink.client.RemoteExecutor.executePlanWithJars(RemoteExecutor.java:211)
 at org.apache.flink.client.RemoteExecutor.executePlan(RemoteExecutor.java:188) 
at 
org.apache.flink.api.java.RemoteEnvironment.execute(RemoteEnvironment.java:172) 
at 
org.apache.flink.api.java.ExecutionEnvironment.execute(ExecutionEnvironment.java:896)
 at 
org.apache.flink.api.scala.ExecutionEnvironment.execute(ExecutionEnvironment.scala:637)
 at org.apache.flink.api.scala.DataSet.collect(DataSet.scala:547) ... 36 elided 
Caused by: org.apache.flink.runtime.client.JobExecutionException: Communication 
with JobManager failed: Lost connection to the JobManager. at 
org.apache.flink.runtime.client.JobClient.submitJobAndWait(JobClient.java:137) 
at org.apache.flink.client.program.ClusterClient.run(ClusterClient.java:405) 
... 46 more Caused by: 
org.apache.flink.runtime.client.JobClientActorConnectionTimeoutException: Lost 
connection to the JobManager. at 
org.apache.flink.runtime.client.JobClientActor.handleMessage(JobClientActor.java:252)
 at 
org.apache.flink.runtime.akka.FlinkUntypedActor.handleLeaderSessionID(FlinkUntypedActor.java:90)
 at 
org.apache.flink.runtime.akka.FlinkUntypedActor.onReceive(FlinkUntypedActor.java:70)
 at 
akka.actor.UntypedActor$$anonfun$receive$1.applyOrElse(UntypedActor.scala:167) 
at akka.actor.Actor$class.aroundReceive(Actor.scala:465) at 
akka.actor.UntypedActor.aroundReceive(UntypedActor.scala:97) at 
akka.actor.ActorCell.receiveMessage(ActorCell.scala:516) at 
akka.actor.ActorCell.invoke(ActorCell.scala:487) at 
akka.dispatch.Mailbox.processMailbox(Mailbox.scala:254) at 
akka.dispatch.Mailbox.run(Mailbox.scala:221) at 
akka.dispatch.Mailbox.exec(Mailbox.scala:231) at 
scala.concurrent.forkjoin.ForkJoinTask.doExec(ForkJoinTask.java:260) at 
scala.concurrent.forkjoin.ForkJoinPool$WorkQueue.pollAndExecAll(ForkJoinPool.java:1253)
 at 
scala.concurrent.forkjoin.ForkJoinPool$WorkQueue.runTask(ForkJoinPool.java:1346)
 at scala.concurrent.forkjoin.ForkJoinPool.runWorker(ForkJoinPool.java:1979) at 
scala.concurrent.forkjoin.ForkJoinWorkerThread.run(ForkJoinWorkerThread.java:107)

```

I can make a success telnet command from the node of zeppelin to the kubernetes 
flink jobmanager service port



--
This message was sent by Atlassian Jira
(v8.3.2#803003)

Reply via email to