[ 
https://issues.apache.org/jira/browse/FLINK-21329?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17284596#comment-17284596
 ] 

Dawid Wysakowicz commented on FLINK-21329:
------------------------------------------

Similar outcome: 
https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=13326&view=logs&j=6caf31d6-847a-526e-9624-468e053467d6&t=0b23652f-b18b-5b6e-6eb6-a11070364610

{code}
2021-02-15T01:02:52.3770907Z 
------------------------------------------------------------
2021-02-15T01:02:52.3772409Z  The program finished with the following exception:
2021-02-15T01:02:52.3773214Z 
2021-02-15T01:02:52.3774604Z 
org.apache.flink.client.program.ProgramInvocationException: The main method 
caused an error: java.util.concurrent.TimeoutException
2021-02-15T01:02:52.3775820Z    at 
org.apache.flink.client.program.PackagedProgram.callMainMethod(PackagedProgram.java:366)
2021-02-15T01:02:52.3776889Z    at 
org.apache.flink.client.program.PackagedProgram.invokeInteractiveModeForExecution(PackagedProgram.java:219)
2021-02-15T01:02:52.3778111Z    at 
org.apache.flink.client.ClientUtils.executeProgram(ClientUtils.java:114)
2021-02-15T01:02:52.3778824Z    at 
org.apache.flink.client.cli.CliFrontend.executeProgram(CliFrontend.java:812)
2021-02-15T01:02:52.3779712Z    at 
org.apache.flink.client.cli.CliFrontend.run(CliFrontend.java:246)
2021-02-15T01:02:52.3780328Z    at 
org.apache.flink.client.cli.CliFrontend.parseAndRun(CliFrontend.java:1054)
2021-02-15T01:02:52.3780963Z    at 
org.apache.flink.client.cli.CliFrontend.lambda$main$10(CliFrontend.java:1132)
2021-02-15T01:02:52.3781663Z    at 
org.apache.flink.runtime.security.contexts.NoOpSecurityContext.runSecured(NoOpSecurityContext.java:28)
2021-02-15T01:02:52.3782340Z    at 
org.apache.flink.client.cli.CliFrontend.main(CliFrontend.java:1132)
2021-02-15T01:02:52.3782958Z Caused by: 
java.util.concurrent.ExecutionException: java.util.concurrent.TimeoutException
2021-02-15T01:02:52.3783619Z    at 
java.base/java.util.concurrent.CompletableFuture.reportGet(CompletableFuture.java:395)
2021-02-15T01:02:52.3784257Z    at 
java.base/java.util.concurrent.CompletableFuture.get(CompletableFuture.java:1999)
2021-02-15T01:02:52.3784973Z    at 
org.apache.flink.client.program.StreamContextEnvironment.getJobExecutionResult(StreamContextEnvironment.java:123)
2021-02-15T01:02:52.3785740Z    at 
org.apache.flink.client.program.StreamContextEnvironment.execute(StreamContextEnvironment.java:80)
2021-02-15T01:02:52.3786561Z    at 
org.apache.flink.streaming.api.environment.StreamExecutionEnvironment.execute(StreamExecutionEnvironment.java:1782)
2021-02-15T01:02:52.3787396Z    at 
org.apache.flink.streaming.tests.StickyAllocationAndLocalRecoveryTestJob.main(StickyAllocationAndLocalRecoveryTestJob.java:138)
2021-02-15T01:02:52.3788110Z    at 
java.base/jdk.internal.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
2021-02-15T01:02:52.3788757Z    at 
java.base/jdk.internal.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
2021-02-15T01:02:52.3791150Z    at 
java.base/jdk.internal.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
2021-02-15T01:02:52.3791827Z    at 
java.base/java.lang.reflect.Method.invoke(Method.java:566)
2021-02-15T01:02:52.3792706Z    at 
org.apache.flink.client.program.PackagedProgram.callMainMethod(PackagedProgram.java:349)
2021-02-15T01:02:52.3793248Z    ... 8 more
2021-02-15T01:02:52.3793651Z Caused by: java.util.concurrent.TimeoutException
2021-02-15T01:02:52.3794206Z    at 
org.apache.flink.runtime.concurrent.FutureUtils$Timeout.run(FutureUtils.java:1220)
2021-02-15T01:02:52.3794897Z    at 
org.apache.flink.runtime.concurrent.DirectExecutorService.execute(DirectExecutorService.java:217)
2021-02-15T01:02:52.3795593Z    at 
org.apache.flink.runtime.concurrent.FutureUtils.lambda$orTimeout$15(FutureUtils.java:582)
2021-02-15T01:02:52.3796266Z    at 
java.base/java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:515)
2021-02-15T01:02:52.3796880Z    at 
java.base/java.util.concurrent.FutureTask.run(FutureTask.java:264)
2021-02-15T01:02:52.3797574Z    at 
java.base/java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.run(ScheduledThreadPoolExecutor.java:304)
2021-02-15T01:02:52.3798805Z    at 
java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1128)
2021-02-15T01:02:52.3800632Z    at 
java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:628)
2021-02-15T01:02:52.3801242Z    at 
java.base/java.lang.Thread.run(Thread.java:834)
2021-02-15T01:54:04.9335391Z 
==========================================================================================
2021-02-15T01:54:04.9336579Z === WARNING: This E2E Run took already 80% of the 
allocated time budget of 250 minutes ===
2021-02-15T01:54:04.9338033Z 
==========================================================================================
2021-02-15T02:33:04.9292547Z 
======================================================================================================
2021-02-15T02:33:04.9293721Z === WARNING: This E2E Run will time out in the 
next few minutes. Starting to upload the log output ===
2021-02-15T02:33:04.9294789Z 
======================================================================================================
{code}

> "Local recovery and sticky scheduling end-to-end test" does not finish within 
> 600 seconds
> -----------------------------------------------------------------------------------------
>
>                 Key: FLINK-21329
>                 URL: https://issues.apache.org/jira/browse/FLINK-21329
>             Project: Flink
>          Issue Type: Bug
>          Components: Runtime / Coordination
>    Affects Versions: 1.13.0
>            Reporter: Robert Metzger
>            Priority: Critical
>              Labels: test-stability
>             Fix For: 1.13.0
>
>
> https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=13118&view=logs&j=c88eea3b-64a0-564d-0031-9fdcd7b8abee&t=ff888d9b-cd34-53cc-d90f-3e446d355529
> {code}
> Feb 08 22:25:46 
> ==============================================================================
> Feb 08 22:25:46 Running 'Local recovery and sticky scheduling end-to-end test'
> Feb 08 22:25:46 
> ==============================================================================
> Feb 08 22:25:46 TEST_DATA_DIR: 
> /home/vsts/work/1/s/flink-end-to-end-tests/test-scripts/temp-test-directory-46881214821
> Feb 08 22:25:47 Flink dist directory: 
> /home/vsts/work/1/s/flink-dist/target/flink-1.13-SNAPSHOT-bin/flink-1.13-SNAPSHOT
> Feb 08 22:25:47 Running local recovery test with configuration:
> Feb 08 22:25:47         parallelism: 4
> Feb 08 22:25:47         max attempts: 10
> Feb 08 22:25:47         backend: rocks
> Feb 08 22:25:47         incremental checkpoints: false
> Feb 08 22:25:47         kill JVM: false
> Feb 08 22:25:47 Starting zookeeper daemon on host fv-az127-394.
> Feb 08 22:25:47 Starting HA cluster with 1 masters.
> Feb 08 22:25:48 Starting standalonesession daemon on host fv-az127-394.
> Feb 08 22:25:49 Starting taskexecutor daemon on host fv-az127-394.
> Feb 08 22:25:49 Waiting for Dispatcher REST endpoint to come up...
> Feb 08 22:25:50 Waiting for Dispatcher REST endpoint to come up...
> Feb 08 22:25:51 Waiting for Dispatcher REST endpoint to come up...
> Feb 08 22:25:53 Waiting for Dispatcher REST endpoint to come up...
> Feb 08 22:25:54 Dispatcher REST endpoint is up.
> Feb 08 22:25:54 Started TM watchdog with PID 28961.
> Feb 08 22:25:58 Job has been submitted with JobID 
> e790e85a39040539f9386c0df7ca4812
> Feb 08 22:35:47 Test (pid: 27970) did not finish after 600 seconds.
> Feb 08 22:35:47 Printing Flink logs and killing it:
> {code}
> and
> {code}
>       at 
> org.apache.flink.runtime.leaderretrieval.ZooKeeperLeaderRetrievalDriver.unhandledError(ZooKeeperLeaderRetrievalDriver.java:184)
>       at 
> org.apache.flink.shaded.curator4.org.apache.curator.framework.imps.CuratorFrameworkImpl$6.apply(CuratorFrameworkImpl.java:713)
>       at 
> org.apache.flink.shaded.curator4.org.apache.curator.framework.imps.CuratorFrameworkImpl$6.apply(CuratorFrameworkImpl.java:709)
>       at 
> org.apache.flink.shaded.curator4.org.apache.curator.framework.listen.ListenerContainer$1.run(ListenerContainer.java:100)
>       at 
> org.apache.flink.shaded.curator4.org.apache.curator.shaded.com.google.common.util.concurrent.DirectExecutor.execute(DirectExecutor.java:30)
>       at 
> org.apache.flink.shaded.curator4.org.apache.curator.framework.listen.ListenerContainer.forEach(ListenerContainer.java:92)
>       at 
> org.apache.flink.shaded.curator4.org.apache.curator.framework.imps.CuratorFrameworkImpl.logError(CuratorFrameworkImpl.java:708)
>       at 
> org.apache.flink.shaded.curator4.org.apache.curator.framework.imps.CuratorFrameworkImpl.checkBackgroundRetry(CuratorFrameworkImpl.java:874)
>       at 
> org.apache.flink.shaded.curator4.org.apache.curator.framework.imps.CuratorFrameworkImpl.performBackgroundOperation(CuratorFrameworkImpl.java:990)
>       at 
> org.apache.flink.shaded.curator4.org.apache.curator.framework.imps.CuratorFrameworkImpl.backgroundOperationsLoop(CuratorFrameworkImpl.java:943)
>       at 
> org.apache.flink.shaded.curator4.org.apache.curator.framework.imps.CuratorFrameworkImpl.access$300(CuratorFrameworkImpl.java:66)
>       at 
> org.apache.flink.shaded.curator4.org.apache.curator.framework.imps.CuratorFrameworkImpl$4.call(CuratorFrameworkImpl.java:346)
>       at java.util.concurrent.FutureTask.run(FutureTask.java:266)
>       at 
> java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.access$201(ScheduledThreadPoolExecutor.java:180)
>       at 
> java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.run(ScheduledThreadPoolExecutor.java:293)
>       at 
> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
>       at 
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
>       at java.lang.Thread.run(Thread.java:748)
> Caused by: 
> org.apache.flink.shaded.zookeeper3.org.apache.zookeeper.KeeperException$ConnectionLossException:
>  KeeperErrorCode = ConnectionLoss
>       at 
> org.apache.flink.shaded.zookeeper3.org.apache.zookeeper.KeeperException.create(KeeperException.java:102)
>       at 
> org.apache.flink.shaded.curator4.org.apache.curator.framework.imps.CuratorFrameworkImpl.checkBackgroundRetry(CuratorFrameworkImpl.java:862)
>       ... 10 more
> {code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to