[ 
https://issues.apache.org/jira/browse/FLINK-4808?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15842945#comment-15842945
 ] 

Stephan Ewen commented on FLINK-4808:
-------------------------------------

[~ram_krish] I think #2 would be great to address first.

> Allow skipping failed checkpoints
> ---------------------------------
>
>                 Key: FLINK-4808
>                 URL: https://issues.apache.org/jira/browse/FLINK-4808
>             Project: Flink
>          Issue Type: New Feature
>    Affects Versions: 1.1.2, 1.1.3
>            Reporter: Stephan Ewen
>
> Currently, if Flink cannot complete a checkpoint, it results in a failure and 
> recovery.
> To make the impact of less stable storage infrastructure on the performance 
> of Flink less severe, Flink should be able to tolerate a certain number of 
> failed checkpoints and simply keep executing.
> This should be controllable via a parameter, for example:
> {code}
> env.getCheckpointConfig().setAllowedFailedCheckpoints(3);
> {code}
> A value of {{-1}} could indicate an infinite number of checkpoint failures 
> tolerated by Flink.
> The default value should still be {{0}}, to keep compatibility with the 
> existing behavior.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to