[ 
https://issues.apache.org/jira/browse/FLINK-22577?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Till Rohrmann closed FLINK-22577.
---------------------------------
    Fix Version/s: 1.12.4
                   1.13.1
                   1.14.0
       Resolution: Fixed

Fixed via

1.14.0: fbf84acf63102db455c89cb8e497cda423a1c4d5
1.13.1: 3ff9eb7029784349fb135e6849b745ba82c7b8c0
1.12.4: ed9965c33853ab95e0a3264b772f82fd8404239a

> KubernetesLeaderElectionAndRetrievalITCase is failing
> -----------------------------------------------------
>
>                 Key: FLINK-22577
>                 URL: https://issues.apache.org/jira/browse/FLINK-22577
>             Project: Flink
>          Issue Type: Bug
>          Components: Deployment / Kubernetes, Runtime / Coordination
>    Affects Versions: 1.13.0, 1.14.0, 1.12.3
>            Reporter: Matthias
>            Assignee: Till Rohrmann
>            Priority: Critical
>              Labels: pull-request-available, test-stability
>             Fix For: 1.14.0, 1.13.1, 1.12.4
>
>
> {{KubernetesLeaderElectionAndRetrievalITCase}} is failing constantly. Running 
> it locally results in an {{AssertionError}}:
> {code}
> 3069 [KubernetesLeaderElector-ExecutorService-thread-1] DEBUG 
> io.fabric8.kubernetes.client.extended.leaderelection.LeaderElector [] - Loop 
> thread interrupted
> java.lang.InterruptedException: null
>       at 
> java.util.concurrent.locks.AbstractQueuedSynchronizer.doAcquireSharedInterruptibly(AbstractQueuedSynchronizer.java:998)
>  ~[?:1.8.0_265]
>       at 
> java.util.concurrent.locks.AbstractQueuedSynchronizer.acquireSharedInterruptibly(AbstractQueuedSynchronizer.java:1304)
>  ~[?:1.8.0_265]
>       at java.util.concurrent.CountDownLatch.await(CountDownLatch.java:231) 
> ~[?:1.8.0_265]
>       at 
> io.fabric8.kubernetes.client.extended.leaderelection.LeaderElector.loop(LeaderElector.java:200)
>  ~[kubernetes-client-4.9.2.jar:?]
>       at 
> io.fabric8.kubernetes.client.extended.leaderelection.LeaderElector.renewWithTimeout(LeaderElector.java:100)
>  ~[kubernetes-client-4.9.2.jar:?]
>       at 
> io.fabric8.kubernetes.client.extended.leaderelection.LeaderElector.run(LeaderElector.java:71)
>  ~[kubernetes-client-4.9.2.jar:?]
>       at 
> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
>  [?:1.8.0_265]
>       at 
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
>  [?:1.8.0_265]
>       at java.lang.Thread.run(Thread.java:748) [?:1.8.0_265]
> 3078 [KubernetesLeaderElector-ExecutorService-thread-1] ERROR 
> org.apache.flink.runtime.util.FatalExitExceptionHandler [] - FATAL: Thread 
> 'KubernetesLeaderElector-ExecutorService-thread-1' produced an uncaught 
> exception. Stopping the process...
> java.lang.AssertionError: null
>       at 
> org.apache.flink.kubernetes.highavailability.KubernetesLeaderElectionDriver.writeLeaderInformation(KubernetesLeaderElectionDriver.java:130)
>  ~[classes/:?]
>       at 
> org.apache.flink.runtime.leaderelection.TestingLeaderElectionEventHandler.lambda$onRevokeLeadership$1(TestingLeaderElectionEventHandler.java:69)
>  ~[test-classes/:?]
>       at 
> org.apache.flink.runtime.leaderelection.TestingLeaderElectionEventHandler.waitForInitialization(TestingLeaderElectionEventHandler.java:93)
>  ~[test-classes/:?]
>       at 
> org.apache.flink.runtime.leaderelection.TestingLeaderElectionEventHandler.onRevokeLeadership(TestingLeaderElectionEventHandler.java:66)
>  ~[test-classes/:?]
>       at 
> org.apache.flink.kubernetes.highavailability.KubernetesLeaderElectionDriver$LeaderCallbackHandlerImpl.notLeader(KubernetesLeaderElectionDriver.java:202)
>  ~[classes/:?]
>       at 
> io.fabric8.kubernetes.client.extended.leaderelection.LeaderCallbacks.onStopLeading(LeaderCallbacks.java:38)
>  ~[kubernetes-client-4.9.2.jar:?]
>       at 
> io.fabric8.kubernetes.client.extended.leaderelection.LeaderElector.run(LeaderElector.java:72)
>  ~[kubernetes-client-4.9.2.jar:?]
>       at 
> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
>  ~[?:1.8.0_265]
>       at 
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
>  ~[?:1.8.0_265]
>       at java.lang.Thread.run(Thread.java:748) [?:1.8.0_265]
> {code}
> The failure never popped up due to FLINK-22564
> * [1.12 release 
> branch|https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=17554&view=logs&j=c88eea3b-64a0-564d-0031-9fdcd7b8abee&t=ff888d9b-cd34-53cc-d90f-3e446d355529&l=2701]
> * [1.13 release 
> branch|https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=17558&view=logs&j=c88eea3b-64a0-564d-0031-9fdcd7b8abee&t=ff888d9b-cd34-53cc-d90f-3e446d355529&l=2760]
> * 
> [master|https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=17560&view=logs&j=c88eea3b-64a0-564d-0031-9fdcd7b8abee&t=ff888d9b-cd34-53cc-d90f-3e446d355529&l=2757]



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to