[jira] [Updated] (FLINK-24887) Retrying savepoints may cause early cluster shutdown

ASF GitHub Bot (Jira) Fri, 12 Nov 2021 03:24:07 -0800


     [ 
https://issues.apache.org/jira/browse/FLINK-24887?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]


ASF GitHub Bot updated FLINK-24887:
-----------------------------------
    Labels: pull-request-available  (was: )

> Retrying savepoints may cause early cluster shutdown
> ----------------------------------------------------
>
>                 Key: FLINK-24887
>                 URL: https://issues.apache.org/jira/browse/FLINK-24887
>             Project: Flink
>          Issue Type: Bug
>          Components: Runtime / REST
>    Affects Versions: 1.15.0
>            Reporter: Chesnay Schepler
>            Assignee: Chesnay Schepler
>            Priority: Major
>              Labels: pull-request-available
>             Fix For: 1.15.0
>
>
> If an operation is retried we potentially access the result of a previous 
> attempt to see if it has already failed and eagerly fail the trigger request. 
> If that attempt is already complete then this may lead to an unexpected 
> shutdown of the cluster.
> Beyond this issue, the eager checking of previous attempts makes error 
> handling more complicated, because you have to cover all cases for both the 
> trigger and status-retrieval operations.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

[jira] [Updated] (FLINK-24887) Retrying savepoints may cause early cluster shutdown

Reply via email to