[ 
https://issues.apache.org/jira/browse/FLINK-22577?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Matthias updated FLINK-22577:
-----------------------------
    Labels: test-stability  (was: )

> KubernetesLeaderElectionAndRetrievalITCase is failing
> -----------------------------------------------------
>
>                 Key: FLINK-22577
>                 URL: https://issues.apache.org/jira/browse/FLINK-22577
>             Project: Flink
>          Issue Type: Bug
>          Components: Deployment / Kubernetes, Runtime / Coordination
>    Affects Versions: 1.13.0, 1.14.0, 1.12.3
>            Reporter: Matthias
>            Priority: Critical
>              Labels: test-stability
>
> {{KubernetesLeaderElectionAndRetrievalITCase}} is failing constantly. Running 
> it locally results in an {{AssertionError}}:
> {code}
> 3069 [KubernetesLeaderElector-ExecutorService-thread-1] DEBUG 
> io.fabric8.kubernetes.client.extended.leaderelection.LeaderElector [] - Loop 
> thread interrupted
> java.lang.InterruptedException: null
>       at 
> java.util.concurrent.locks.AbstractQueuedSynchronizer.doAcquireSharedInterruptibly(AbstractQueuedSynchronizer.java:998)
>  ~[?:1.8.0_265]
>       at 
> java.util.concurrent.locks.AbstractQueuedSynchronizer.acquireSharedInterruptibly(AbstractQueuedSynchronizer.java:1304)
>  ~[?:1.8.0_265]
>       at java.util.concurrent.CountDownLatch.await(CountDownLatch.java:231) 
> ~[?:1.8.0_265]
>       at 
> io.fabric8.kubernetes.client.extended.leaderelection.LeaderElector.loop(LeaderElector.java:200)
>  ~[kubernetes-client-4.9.2.jar:?]
>       at 
> io.fabric8.kubernetes.client.extended.leaderelection.LeaderElector.renewWithTimeout(LeaderElector.java:100)
>  ~[kubernetes-client-4.9.2.jar:?]
>       at 
> io.fabric8.kubernetes.client.extended.leaderelection.LeaderElector.run(LeaderElector.java:71)
>  ~[kubernetes-client-4.9.2.jar:?]
>       at 
> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
>  [?:1.8.0_265]
>       at 
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
>  [?:1.8.0_265]
>       at java.lang.Thread.run(Thread.java:748) [?:1.8.0_265]
> 3078 [KubernetesLeaderElector-ExecutorService-thread-1] ERROR 
> org.apache.flink.runtime.util.FatalExitExceptionHandler [] - FATAL: Thread 
> 'KubernetesLeaderElector-ExecutorService-thread-1' produced an uncaught 
> exception. Stopping the process...
> java.lang.AssertionError: null
>       at 
> org.apache.flink.kubernetes.highavailability.KubernetesLeaderElectionDriver.writeLeaderInformation(KubernetesLeaderElectionDriver.java:130)
>  ~[classes/:?]
>       at 
> org.apache.flink.runtime.leaderelection.TestingLeaderElectionEventHandler.lambda$onRevokeLeadership$1(TestingLeaderElectionEventHandler.java:69)
>  ~[test-classes/:?]
>       at 
> org.apache.flink.runtime.leaderelection.TestingLeaderElectionEventHandler.waitForInitialization(TestingLeaderElectionEventHandler.java:93)
>  ~[test-classes/:?]
>       at 
> org.apache.flink.runtime.leaderelection.TestingLeaderElectionEventHandler.onRevokeLeadership(TestingLeaderElectionEventHandler.java:66)
>  ~[test-classes/:?]
>       at 
> org.apache.flink.kubernetes.highavailability.KubernetesLeaderElectionDriver$LeaderCallbackHandlerImpl.notLeader(KubernetesLeaderElectionDriver.java:202)
>  ~[classes/:?]
>       at 
> io.fabric8.kubernetes.client.extended.leaderelection.LeaderCallbacks.onStopLeading(LeaderCallbacks.java:38)
>  ~[kubernetes-client-4.9.2.jar:?]
>       at 
> io.fabric8.kubernetes.client.extended.leaderelection.LeaderElector.run(LeaderElector.java:72)
>  ~[kubernetes-client-4.9.2.jar:?]
>       at 
> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
>  ~[?:1.8.0_265]
>       at 
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
>  ~[?:1.8.0_265]
>       at java.lang.Thread.run(Thread.java:748) [?:1.8.0_265]
> {code}
> The failure never popped up due to FLINK-22564
> * [1.12 release 
> branch|https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=17554&view=logs&j=c88eea3b-64a0-564d-0031-9fdcd7b8abee&t=ff888d9b-cd34-53cc-d90f-3e446d355529&l=2701]
> * [1.13 release 
> branch|https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=17558&view=logs&j=c88eea3b-64a0-564d-0031-9fdcd7b8abee&t=ff888d9b-cd34-53cc-d90f-3e446d355529&l=2760]
> * 
> [master|https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=17560&view=logs&j=c88eea3b-64a0-564d-0031-9fdcd7b8abee&t=ff888d9b-cd34-53cc-d90f-3e446d355529&l=2757]



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to