Stephan Ewen created FLINK-4808:
-----------------------------------
Summary: Allow skipping failed checkpoints
Key: FLINK-4808
URL: https://issues.apache.org/jira/browse/FLINK-4808
Project: Flink
Issue Type: New Feature
Affects Versions: 1.1.2, 1.1.3
Reporter: Stephan Ewen
Fix For: 1.2.0
Currently, if Flink cannot complete a checkpoint, it results in a failure and
recovery.
To make the impact of less stable storage infrastructure on the performance of
Flink less severe, Flink should be able to tolerate a certain number of failed
checkpoints and simply keep executing.
This should be controllable via a parameter, for example:
{code}
env.getCheckpointConfig().setAllowedFailedCheckpoints(3);
{code}
A value of {{-1}} could indicate an infinite number of checkpoint failures
tolerated by Flink.
The default value should still be {{0}}, to keep compatibility with the
existing behavior.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)