azagrebin commented on issue #6567: [FLINK-10074] Allowable number of
checkpoint failures
URL: https://github.com/apache/flink/pull/6567#issuecomment-434255898
I created FLINK-10724 to refactor failure handling in checkpoint coordinator
where I believe we should firstly prepare it for bett
azagrebin commented on issue #6567: [FLINK-10074] Allowable number of
checkpoint failures
URL: https://github.com/apache/flink/pull/6567#issuecomment-434217958
Hi @yanghua,
can this PR be closed for now and we come back to it when we have design in
the Jira issue?
We might
azagrebin commented on issue #6567: [FLINK-10074] Allowable number of
checkpoint failures
URL: https://github.com/apache/flink/pull/6567#issuecomment-428137039
Hi @yanghua,
In general, `executionGraph.failGlobal` looks good to me to fail, but I
think the `CheckpointFailureManager` s
azagrebin commented on issue #6567: [FLINK-10074] Allowable number of
checkpoint failures
URL: https://github.com/apache/flink/pull/6567#issuecomment-423942772
> Do we need to take this refactoring into account? Because this PR is
actually a supplement to the checkpoint exception handler.
azagrebin commented on issue #6567: [FLINK-10074] Allowable number of
checkpoint failures
URL: https://github.com/apache/flink/pull/6567#issuecomment-423573937
Thanks for the update, @yanghua. Looking at the checkpoint coordinator more
deeply, I think we firstly have to work a bit more on