Stephan Ewen created FLINK-4808: ----------------------------------- Summary: Allow skipping failed checkpoints Key: FLINK-4808 URL: https://issues.apache.org/jira/browse/FLINK-4808 Project: Flink Issue Type: New Feature Affects Versions: 1.1.2, 1.1.3 Reporter: Stephan Ewen Fix For: 1.2.0
Currently, if Flink cannot complete a checkpoint, it results in a failure and recovery. To make the impact of less stable storage infrastructure on the performance of Flink less severe, Flink should be able to tolerate a certain number of failed checkpoints and simply keep executing. This should be controllable via a parameter, for example: {code} env.getCheckpointConfig().setAllowedFailedCheckpoints(3); {code} A value of {{-1}} could indicate an infinite number of checkpoint failures tolerated by Flink. The default value should still be {{0}}, to keep compatibility with the existing behavior. -- This message was sent by Atlassian JIRA (v6.3.4#6332)