[ 
https://issues.apache.org/jira/browse/FLINK-4201?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15379639#comment-15379639
 ] 

Ufuk Celebi commented on FLINK-4201:
------------------------------------

The shut down hook is actually not a problem, because it is only active in 
standalone recovery mode. The issue is that a suspended execution graph will 
shut down the checkpoint coordinator, which discards all checkpoints on shut 
down. We still need to call shutdown in order to free some resources like the 
timer task, but have to skip discarding the checkpoints if the execution graph 
is suspended and not in a globally terminal state.

> Checkpoints for jobs in non-terminal state (e.g. suspended) get deleted
> -----------------------------------------------------------------------
>
>                 Key: FLINK-4201
>                 URL: https://issues.apache.org/jira/browse/FLINK-4201
>             Project: Flink
>          Issue Type: Bug
>          Components: State Backends, Checkpointing
>            Reporter: Stefan Richter
>            Assignee: Ufuk Celebi
>            Priority: Blocker
>
> For example, when shutting down a Yarn session, according to the logs 
> checkpoints for jobs that did not terminate are deleted. In the shutdown 
> hook, removeAllCheckpoints is called and removes checkpoints that should 
> still be kept.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to