[jira] [Commented] (FLINK-3844) Checkpoint failures should not always lead to job failures

Aljoscha Krettek (JIRA) Fri, 29 Apr 2016 04:51:06 -0700

    [ 
https://issues.apache.org/jira/browse/FLINK-3844?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15263946#comment-15263946
 ]


Aljoscha Krettek commented on FLINK-3844:
-----------------------------------------

+1, we could have something similar to {{RestartStrategy}} but for checkpoints 
that determines when failing checkpoints should crash a job.

> Checkpoint failures should not always lead to job failures
> ----------------------------------------------------------
>
>                 Key: FLINK-3844
>                 URL: https://issues.apache.org/jira/browse/FLINK-3844
>             Project: Flink
>          Issue Type: Improvement
>          Components: Streaming
>            Reporter: Gyula Fora
>
> Currently when a checkpoint fails the job crashes immediately. This is not 
> the desired behaviour in many cases. It would probably be better to log the 
> failed checkpoint attempt and only fail the job after so many subsequent 
> failed attempts.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (FLINK-3844) Checkpoint failures should not always lead to job failures

Reply via email to