Matthias Pohl created FLINK-26742:
-------------------------------------
Summary: DefaultCompletedCheckpointStore.shutdown does not clean
the checkpoints atomically
Key: FLINK-26742
URL: https://issues.apache.org/jira/browse/FLINK-26742
Project: Flink
Issue Type: Bug
Components: Runtime / Coordination
Affects Versions: 1.15.0
Reporter: Matthias Pohl
The {{DefaultCompletedCheckpointStore.shutdown}} removes the Checkpoint entry
from the {{StateHandleStore}} and runs the actual cleanup of the checkpoint
after it got removed. That means that the data is lost if there's an error
while discarding the {{CompletedCheckpoint}} which, as a consequence, doesn't
get picked up anymore during retry.
--
This message was sent by Atlassian Jira
(v8.20.1#820001)