Hi Colleagues,I have a simple test job when I submit it to the Flink cluster
the JM seems to disconnect.Its a one node cluster implemented in a VirtualBox
Centos 7 VM.Flink starts fine and everything else look fine. Following is stack
trace.I appreciate a feedback.Cheers
17/04/12 15:53:04 INFO node.Node: Connected to Node 192.168.56.101
17/04/12 15:53:04 INFO config.ConfigurationProvider: Opened bucket default
17/04/12 15:53:04 INFO config.ConfigurationProvider: Closed bucket default
17/04/12 15:53:04 INFO node.Node: Disconnected from Node 192.168.56.101
17/04/12 15:53:07 INFO kafka.FlinkKafkaConsumerBase: Trying to get topic
metadata from broker localhost:9092 in try 0/3
17/04/12 15:53:07 INFO kafka.FlinkKafkaConsumerBase: Consumer is going to read
the following topics (with number of partitions): abc_pharma_qa (2),
17/04/12 15:53:07 INFO environment.RemoteStreamEnvironment: Running remotely at
192.168.56.101:6123
17/04/12 15:53:07 INFO program.StandaloneClusterClient: Submitting job with
JobID: c9c717d6a6d0d5ce9a8758b0fb7dae7c. Waiting for job completion.
Submitting job with JobID: c9c717d6a6d0d5ce9a8758b0fb7dae7c. Waiting for job
completion.
17/04/12 15:53:07 INFO program.StandaloneClusterClient: Starting client actor
system.
17/04/12 15:53:08 INFO slf4j.Slf4jLogger: Slf4jLogger started
17/04/12 15:53:08 INFO Remoting: Starting remoting
17/04/12 15:53:08 INFO Remoting: Remoting started; listening on addresses
:[akka.tcp://flink@127.0.0.1:32776]
17/04/12 15:53:08 INFO client.JobClientActor: Received job test
(c9c717d6a6d0d5ce9a8758b0fb7dae7c).
17/04/12 15:53:08 INFO client.JobClientActor: Could not submit job test
(c9c717d6a6d0d5ce9a8758b0fb7dae7c), because there is no connection to a
JobManager.
17/04/12 15:53:08 INFO client.JobClientActor: Disconnect from JobManager null.
17/04/12 15:53:08 WARN remote.ReliableDeliverySupervisor: Association with
remote system [akka.tcp://flink@192.168.56.101:6123] has failed, address is now
gated for [5000] ms. Reason is: [Disassociated].
17/04/12 15:54:08 INFO client.JobClientActor: Terminate JobClientActor.
17/04/12 15:54:08 INFO client.JobClientActor: Disconnect from JobManager null.
17/04/12 15:54:08 INFO remote.RemoteActorRefProvider$RemotingTerminator:
Shutting down remote daemon.
17/04/12 15:54:08 INFO remote.RemoteActorRefProvider$RemotingTerminator: Remote
daemon shut down; proceeding with flushing remote transports.
17/04/12 15:54:08 INFO remote.RemoteActorRefProvider$RemotingTerminator:
Remoting shut down.
Exception in thread "main"
org.apache.flink.client.program.ProgramInvocationException: The program
execution failed: Communication with JobManager failed: Lost connection to the
JobManager.
at
org.apache.flink.client.program.ClusterClient.run(ClusterClient.java:413)
at
org.apache.flink.client.program.StandaloneClusterClient.submitJob(StandaloneClusterClient.java:92)
at
org.apache.flink.client.program.ClusterClient.run(ClusterClient.java:389)
at
org.apache.flink.client.program.ClusterClient.run(ClusterClient.java:381)
at
org.apache.flink.streaming.api.environment.RemoteStreamEnvironment.executeRemotely(RemoteStreamEnvironment.java:209)
at
org.apache.flink.streaming.api.environment.RemoteStreamEnvironment.execute(RemoteStreamEnvironment.java:173)
at
com.rfxcel.rts.operations.EventProcessorDriver.start(EventProcessorDriver.java:103)
at
com.rfxcel.rts.operations.EventProcessorDriver.main(EventProcessorDriver.java:109)
Caused by: org.apache.flink.runtime.client.JobExecutionException: Communication
with JobManager failed: Lost connection to the JobManager.
at
org.apache.flink.runtime.client.JobClient.submitJobAndWait(JobClient.java:137)
at
org.apache.flink.client.program.ClusterClient.run(ClusterClient.java:409)
... 7 more
Caused by:
org.apache.flink.runtime.client.JobClientActorConnectionTimeoutException: Lost
connection to the JobManager.
at
org.apache.flink.runtime.client.JobClientActor.handleMessage(JobClientActor.java:245)
at
org.apache.flink.runtime.akka.FlinkUntypedActor.handleLeaderSessionID(FlinkUntypedActor.java:90)
at
org.apache.flink.runtime.akka.FlinkUntypedActor.onReceive(FlinkUntypedActor.java:70)
at
akka.actor.UntypedActor$$anonfun$receive$1.applyOrElse(UntypedActor.scala:167)
at akka.actor.Actor$class.aroundReceive(Actor.scala:465)
at akka.actor.UntypedActor.aroundReceive(UntypedActor.scala:97)
at akka.actor.ActorCell.receiveMessage(ActorCell.scala:516)
at akka.actor.ActorCell.invoke(ActorCell.scala:487)
at akka.dispatch.Mailbox.processMailbox(Mailbox.scala:254)
at akka.dispatch.Mailbox.run(Mailbox.scala:221)
at akka.dispatch.Mailbox.exec(Mailbox.scala:231)
at scala.concurrent.forkjoin.ForkJoinTask.doExec(ForkJoinTask.java:260)
at
scala.concurrent.forkjoin.ForkJoinPool$WorkQueue.pollAndExecAll(ForkJoinPool.java:1253)
at
scala.concurrent.forkjoin.ForkJoinPool$WorkQueue.runTask(ForkJoinPool.java:1346)
at
scala.concurrent.forkjoin.ForkJoinPool.runWorker(ForkJoinPool.java:1979)
at
scala.concurrent.forkjoin.ForkJoinWorkerThread.run(ForkJoinWorkerThread.java:107)