zentol opened a new pull request #10887: [FLINK-15150][tests] Prevent job from 
reaching terminal state
URL: https://github.com/apache/flink/pull/10887
 
 
   Fixes an instability in the `ZookeeperLeaderElectionITCase` where the 
shutdown of the Dispatcher caused a slot allocation to fail, resulting in the 
job failing, reaching a terminal state and afterwards being removed from 
Zookeeper.
   
   We now prevent the job from reaching a terminal state by enabling a 
fixed-delay restart strategy. Should the allocation fail the JM will retry 
until the JM itself is being shut down. On shutdown the JM will suspend the 
job, allowing it to be recovered by other Dispatchers.
   
   The exact behavior for what happens to running jobs when the Dispatcher is 
shut down in an orderly fashion is currently undefined, and this PR makes no 
attempt remedy this.

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
[email protected]


With regards,
Apache Git Services

Reply via email to