[ https://issues.apache.org/jira/browse/FLINK-9097?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16418711#comment-16418711 ]
ASF GitHub Bot commented on FLINK-9097: --------------------------------------- Github user tillrohrmann commented on the issue: https://github.com/apache/flink/pull/5774 I had to put another commit on top of it to fix a problem with the failing `DispatcherTest#testWaitingForJobMasterLeadership` @GJL. The new commit makes sure that we first recover all jobs before we set the fencing token of the `Dispatcher`. That way, no other action can interfere with the job recover, e.g. another job submission. > Jobs can be dropped in HA when job submission fails > --------------------------------------------------- > > Key: FLINK-9097 > URL: https://issues.apache.org/jira/browse/FLINK-9097 > Project: Flink > Issue Type: Bug > Components: Distributed Coordination > Affects Versions: 1.5.0 > Reporter: Till Rohrmann > Assignee: Till Rohrmann > Priority: Blocker > Labels: flip-6 > Fix For: 1.5.0 > > > Jobs can be dropped in HA mode if the job submission step fails. In such a > case, we should fail fatally to let the {{Dispatcher}} restart and retry to > recover all jobs. -- This message was sent by Atlassian JIRA (v7.6.3#76005)