[
https://issues.apache.org/jira/browse/FLINK-4810?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16716171#comment-16716171
]
ASF GitHub Bot commented on FLINK-4810:
---------------------------------------
ramkrish86 commented on issue #3334: FLINK-4810 Checkpoint Coordinator should
fail ExecutionGraph after "n" unsuccessful checkpoints
URL: https://github.com/apache/flink/pull/3334#issuecomment-446070175
@azagrebin - Thanks for the ping. Currently am not working on this. Pls feel
free to work on this or the related JIRA FLINK-10074. I would add myself as a
watcher to understand more about it. Thanks once again.
----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:
[email protected]
> Checkpoint Coordinator should fail ExecutionGraph after "n" unsuccessful
> checkpoints
> ------------------------------------------------------------------------------------
>
> Key: FLINK-4810
> URL: https://issues.apache.org/jira/browse/FLINK-4810
> Project: Flink
> Issue Type: Sub-task
> Components: State Backends, Checkpointing
> Reporter: Stephan Ewen
> Priority: Major
> Labels: pull-request-available
>
> The Checkpoint coordinator should track the number of consecutive
> unsuccessful checkpoints.
> If more than {{n}} (configured value) checkpoints fail in a row, it should
> call {{fail()}} on the execution graph to trigger a recovery.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)