[
https://issues.apache.org/jira/browse/FLINK-4810?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15894047#comment-15894047
]
ASF GitHub Bot commented on FLINK-4810:
---------------------------------------
Github user ramkrish86 commented on the issue:
https://github.com/apache/flink/pull/3334
@StephanEwen , @wenlong88 , @shixiaogang
Pls have a look at the latest push. Now I am tracking the failures in the
checkpointing and incrementing a new counter based on it. Added test cases
also.
I have not changed the constructors of the affected class because it
touches many files. I can update it based on the feedback of the latest PR.
> Checkpoint Coordinator should fail ExecutionGraph after "n" unsuccessful
> checkpoints
> ------------------------------------------------------------------------------------
>
> Key: FLINK-4810
> URL: https://issues.apache.org/jira/browse/FLINK-4810
> Project: Flink
> Issue Type: Sub-task
> Components: State Backends, Checkpointing
> Reporter: Stephan Ewen
>
> The Checkpoint coordinator should track the number of consecutive
> unsuccessful checkpoints.
> If more than {{n}} (configured value) checkpoints fail in a row, it should
> call {{fail()}} on the execution graph to trigger a recovery.
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)