[
https://issues.apache.org/jira/browse/FLINK-24887?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
ASF GitHub Bot updated FLINK-24887:
-----------------------------------
Labels: pull-request-available (was: )
> Retrying savepoints may cause early cluster shutdown
> ----------------------------------------------------
>
> Key: FLINK-24887
> URL: https://issues.apache.org/jira/browse/FLINK-24887
> Project: Flink
> Issue Type: Bug
> Components: Runtime / REST
> Affects Versions: 1.15.0
> Reporter: Chesnay Schepler
> Assignee: Chesnay Schepler
> Priority: Major
> Labels: pull-request-available
> Fix For: 1.15.0
>
>
> If an operation is retried we potentially access the result of a previous
> attempt to see if it has already failed and eagerly fail the trigger request.
> If that attempt is already complete then this may lead to an unexpected
> shutdown of the cluster.
> Beyond this issue, the eager checking of previous attempts makes error
> handling more complicated, because you have to cover all cases for both the
> trigger and status-retrieval operations.
--
This message was sent by Atlassian Jira
(v8.20.1#820001)