Till Rohrmann commented on FLINK-7352:

I think [~StephanEwen] is right and the problem is 
 You can simulate it by removing the sleep and introducing a small sleep in 

I think the solution would be to wait on the {{SimpleAckingTaskManagerGateway}} 
until it has received all task submissions before switching the {{Executions}} 
to running.

> ExecutionGraphRestartTest timeouts
> ----------------------------------
>                 Key: FLINK-7352
>                 URL: https://issues.apache.org/jira/browse/FLINK-7352
>             Project: Flink
>          Issue Type: Bug
>          Components: Distributed Coordination, Tests
>    Affects Versions: 1.4.0, 1.3.2
>            Reporter: Nico Kruber
>            Priority: Critical
>              Labels: test-stability
> Recently, I received timeouts from some tests in 
> {{ExecutionGraphRestartTest}} like this
> {code}
> Tests in error: 
>   ExecutionGraphRestartTest.testConcurrentLocalFailAndRestart:638 ยป Timeout
> {code}
> This particular instance is from 1.3.2 RC2 and stuck in 
> {{ExecutionGraphTestUtils#waitUntilDeployedAndSwitchToRunning()}} but I also 
> had instances stuck in {{ExecutionGraphTestUtils#waitUntilJobStatus}}.

This message was sent by Atlassian JIRA

Reply via email to