[
https://issues.apache.org/jira/browse/FLINK-22577?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Till Rohrmann closed FLINK-22577.
---------------------------------
Fix Version/s: 1.12.4
1.13.1
1.14.0
Resolution: Fixed
Fixed via
1.14.0: fbf84acf63102db455c89cb8e497cda423a1c4d5
1.13.1: 3ff9eb7029784349fb135e6849b745ba82c7b8c0
1.12.4: ed9965c33853ab95e0a3264b772f82fd8404239a
> KubernetesLeaderElectionAndRetrievalITCase is failing
> -----------------------------------------------------
>
> Key: FLINK-22577
> URL: https://issues.apache.org/jira/browse/FLINK-22577
> Project: Flink
> Issue Type: Bug
> Components: Deployment / Kubernetes, Runtime / Coordination
> Affects Versions: 1.13.0, 1.14.0, 1.12.3
> Reporter: Matthias
> Assignee: Till Rohrmann
> Priority: Critical
> Labels: pull-request-available, test-stability
> Fix For: 1.14.0, 1.13.1, 1.12.4
>
>
> {{KubernetesLeaderElectionAndRetrievalITCase}} is failing constantly. Running
> it locally results in an {{AssertionError}}:
> {code}
> 3069 [KubernetesLeaderElector-ExecutorService-thread-1] DEBUG
> io.fabric8.kubernetes.client.extended.leaderelection.LeaderElector [] - Loop
> thread interrupted
> java.lang.InterruptedException: null
> at
> java.util.concurrent.locks.AbstractQueuedSynchronizer.doAcquireSharedInterruptibly(AbstractQueuedSynchronizer.java:998)
> ~[?:1.8.0_265]
> at
> java.util.concurrent.locks.AbstractQueuedSynchronizer.acquireSharedInterruptibly(AbstractQueuedSynchronizer.java:1304)
> ~[?:1.8.0_265]
> at java.util.concurrent.CountDownLatch.await(CountDownLatch.java:231)
> ~[?:1.8.0_265]
> at
> io.fabric8.kubernetes.client.extended.leaderelection.LeaderElector.loop(LeaderElector.java:200)
> ~[kubernetes-client-4.9.2.jar:?]
> at
> io.fabric8.kubernetes.client.extended.leaderelection.LeaderElector.renewWithTimeout(LeaderElector.java:100)
> ~[kubernetes-client-4.9.2.jar:?]
> at
> io.fabric8.kubernetes.client.extended.leaderelection.LeaderElector.run(LeaderElector.java:71)
> ~[kubernetes-client-4.9.2.jar:?]
> at
> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
> [?:1.8.0_265]
> at
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
> [?:1.8.0_265]
> at java.lang.Thread.run(Thread.java:748) [?:1.8.0_265]
> 3078 [KubernetesLeaderElector-ExecutorService-thread-1] ERROR
> org.apache.flink.runtime.util.FatalExitExceptionHandler [] - FATAL: Thread
> 'KubernetesLeaderElector-ExecutorService-thread-1' produced an uncaught
> exception. Stopping the process...
> java.lang.AssertionError: null
> at
> org.apache.flink.kubernetes.highavailability.KubernetesLeaderElectionDriver.writeLeaderInformation(KubernetesLeaderElectionDriver.java:130)
> ~[classes/:?]
> at
> org.apache.flink.runtime.leaderelection.TestingLeaderElectionEventHandler.lambda$onRevokeLeadership$1(TestingLeaderElectionEventHandler.java:69)
> ~[test-classes/:?]
> at
> org.apache.flink.runtime.leaderelection.TestingLeaderElectionEventHandler.waitForInitialization(TestingLeaderElectionEventHandler.java:93)
> ~[test-classes/:?]
> at
> org.apache.flink.runtime.leaderelection.TestingLeaderElectionEventHandler.onRevokeLeadership(TestingLeaderElectionEventHandler.java:66)
> ~[test-classes/:?]
> at
> org.apache.flink.kubernetes.highavailability.KubernetesLeaderElectionDriver$LeaderCallbackHandlerImpl.notLeader(KubernetesLeaderElectionDriver.java:202)
> ~[classes/:?]
> at
> io.fabric8.kubernetes.client.extended.leaderelection.LeaderCallbacks.onStopLeading(LeaderCallbacks.java:38)
> ~[kubernetes-client-4.9.2.jar:?]
> at
> io.fabric8.kubernetes.client.extended.leaderelection.LeaderElector.run(LeaderElector.java:72)
> ~[kubernetes-client-4.9.2.jar:?]
> at
> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
> ~[?:1.8.0_265]
> at
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
> ~[?:1.8.0_265]
> at java.lang.Thread.run(Thread.java:748) [?:1.8.0_265]
> {code}
> The failure never popped up due to FLINK-22564
> * [1.12 release
> branch|https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=17554&view=logs&j=c88eea3b-64a0-564d-0031-9fdcd7b8abee&t=ff888d9b-cd34-53cc-d90f-3e446d355529&l=2701]
> * [1.13 release
> branch|https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=17558&view=logs&j=c88eea3b-64a0-564d-0031-9fdcd7b8abee&t=ff888d9b-cd34-53cc-d90f-3e446d355529&l=2760]
> *
> [master|https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=17560&view=logs&j=c88eea3b-64a0-564d-0031-9fdcd7b8abee&t=ff888d9b-cd34-53cc-d90f-3e446d355529&l=2757]
--
This message was sent by Atlassian Jira
(v8.3.4#803005)