[ 
https://issues.apache.org/jira/browse/FLINK-7066?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Robert Metzger updated FLINK-7066:
----------------------------------
    Component/s:     (was: Tests)
                 Connectors / Kafka

> Kafka integration tests failing in "airplane mode"
> --------------------------------------------------
>
>                 Key: FLINK-7066
>                 URL: https://issues.apache.org/jira/browse/FLINK-7066
>             Project: Flink
>          Issue Type: Bug
>          Components: Connectors / Kafka
>            Reporter: Piotr Nowojski
>            Assignee: Piotr Nowojski
>            Priority: Major
>             Fix For: 1.4.0
>
>
> Tests KafkaXXXProducerITCase are failing on my laptop in airplane mode. It 
> seemed to have something to do with some service listening on wrong 
> interface, when client tries to connect to different host. Strangely tests 
> for Kafka010 and Kafka011 fails with different error, but there is the same 
> fix for them (maybe in Kafka010 original exception is masked by some other 
> error). Kafka 0.11 tests fails like this:
> {code}
> 35309 [flink-akka.actor.default-dispatcher-3] INFO  Remoting  - Starting 
> remoting
> 42445 [flink-akka.actor.default-dispatcher-3] INFO  Remoting  - Remoting 
> started; listening on addresses 
> :[akka.tcp://flink@fe80:0:0:0:165d:140b:f597:e019%13:54398]
> 42445 [main] INFO  org.apache.flink.runtime.client.JobClient  - Started 
> JobClient actor system at [fe80::165d:140b:f597:e019]:54398
> 42450 [flink-akka.actor.default-dispatcher-5] INFO  
> org.apache.flink.runtime.client.JobSubmissionClientActor  - Disconnect from 
> JobManager null.
> 42461 [flink-akka.actor.default-dispatcher-5] INFO  
> org.apache.flink.runtime.client.JobSubmissionClientActor  - Received 
> SubmitJobAndWait(JobGraph(jobId: 3b11234d116ab1ed3c1279dd73dfaab5)) but there 
> is no connection to a JobManager yet.
> 42462 [flink-akka.actor.default-dispatcher-5] INFO  
> org.apache.flink.runtime.client.JobSubmissionClientActor  - Received job 
> Exactly once test (3b11234d116ab1ed3c1279dd73dfaab5).
> 52473 [flink-akka.actor.default-dispatcher-5] INFO  
> org.apache.flink.runtime.client.JobSubmissionClientActor  - Terminate 
> JobClientActor.
> 52473 [flink-akka.actor.default-dispatcher-5] INFO  
> org.apache.flink.runtime.client.JobSubmissionClientActor  - Disconnect from 
> JobManager null.
> org.apache.flink.runtime.client.JobExecutionException: Couldn't retrieve the 
> JobExecutionResult from the JobManager.
>       at 
> org.apache.flink.runtime.client.JobClient.awaitJobResult(JobClient.java:309)
> ...
> Caused by: 
> org.apache.flink.runtime.client.JobClientActorConnectionTimeoutException: 
> Lost connection to the JobManager.
>       at 
> org.apache.flink.runtime.client.JobClientActor.handleMessage(JobClientActor.java:219)
> ...
> {code}
> I think the issue is that there is someone listening on 
> fe80:0:0:0:165d:140b:f597:e019 (note that this is ipv6 address from some 
> virtual utun0 interface on my machine), while JobClient tries to connect to 
> "localhost" - which fails. When I enable wifi and connect to any network and 
> log looks like this:
> {code}
> 32981 [flink-akka.actor.default-dispatcher-2] INFO  Remoting  - Starting 
> remoting
> 32995 [flink-akka.actor.default-dispatcher-3] INFO  Remoting  - Remoting 
> started; listening on addresses :[akka.tcp://[email protected]:55576]
> address = akka.tcp://[email protected]:55576
> 33000 [main] INFO  org.apache.flink.runtime.client.JobClient  - Started 
> JobClient actor system at 192.168.178.125:55576
> 33005 [flink-akka.actor.default-dispatcher-2] INFO  
> org.apache.flink.runtime.client.JobSubmissionClientActor  - Disconnect from 
> JobManager null.
> submitJobAndWait config = {restart-strategy.fixed-delay.delay=0 s, 
> local.number-taskmanager=1, taskmanager.network.netty.client.numThreads=1, 
> metrics.reporter.my_reporter.class=org.apache.flink.metrics.jmx.JMXReporter, 
> jobmanager.rpc.address=localhost, taskmanager.numberOfTaskSlots=8, 
> taskmanager.memory.size=16, metrics.reporters=my_reporter, 
> taskmanager.network.netty.server.numThreads=2, jobmanager.rpc.port=55566, 
> query.server.enable=false}
> 33013 [flink-akka.actor.default-dispatcher-2] INFO  
> org.apache.flink.runtime.client.JobSubmissionClientActor  - Received 
> SubmitJobAndWait(JobGraph(jobId: ac67638ac85a2179a37486d507a1a008)) but there 
> is no connection to a JobManager yet.
> 33014 [flink-akka.actor.default-dispatcher-2] INFO  
> org.apache.flink.runtime.client.JobSubmissionClientActor  - Received job 
> Exactly once test (ac67638ac85a2179a37486d507a1a008).
> 33024 [flink-akka.actor.default-dispatcher-2] INFO  
> org.apache.flink.runtime.client.JobSubmissionClientActor  - Connect to 
> JobManager 
> Actor[akka.tcp://flink@localhost:55566/user/jobmanager#-1394172571].
> {code}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to