[ https://issues.apache.org/jira/browse/FLINK-3844?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15263946#comment-15263946 ]
Aljoscha Krettek commented on FLINK-3844: ----------------------------------------- +1, we could have something similar to {{RestartStrategy}} but for checkpoints that determines when failing checkpoints should crash a job. > Checkpoint failures should not always lead to job failures > ---------------------------------------------------------- > > Key: FLINK-3844 > URL: https://issues.apache.org/jira/browse/FLINK-3844 > Project: Flink > Issue Type: Improvement > Components: Streaming > Reporter: Gyula Fora > > Currently when a checkpoint fails the job crashes immediately. This is not > the desired behaviour in many cases. It would probably be better to log the > failed checkpoint attempt and only fail the job after so many subsequent > failed attempts. -- This message was sent by Atlassian JIRA (v6.3.4#6332)