[ https://issues.apache.org/jira/browse/FLINK-5193?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
ASF GitHub Bot updated FLINK-5193: ---------------------------------- Labels: pull-request-available (was: ) > Recovering all jobs fails completely if a single recovery fails > --------------------------------------------------------------- > > Key: FLINK-5193 > URL: https://issues.apache.org/jira/browse/FLINK-5193 > Project: Flink > Issue Type: Bug > Components: Runtime / Coordination > Affects Versions: 1.1.3, 1.2.0 > Reporter: Till Rohrmann > Assignee: Till Rohrmann > Priority: Major > Labels: pull-request-available > Fix For: 1.1.4, 1.2.0 > > > In HA case where the {{JobManager}} tries to recover all submitted job > graphs, e.g. when regaining leadership, it can happen that none of the > submitted jobs are recovered if a single recovery fails. Instead of failing > the complete recovery procedure, the {{JobManager}} should still try to > recover the remaining (non-failing) jobs and print a proper error message for > the failed recoveries. -- This message was sent by Atlassian Jira (v8.20.10#820010)