Zakelly Lan created FLINK-38347:
-----------------------------------
Summary: Checkpoint file-merging manager may delete the directory
unexpectedly when some RPC messages lost
Key: FLINK-38347
URL: https://issues.apache.org/jira/browse/FLINK-38347
Project: Flink
Issue Type: Bug
Affects Versions: 1.20.2, 2.0.0, 2.1.0
Reporter: Zakelly Lan
Assignee: Zakelly Lan
In FLINK-32086, we delete the orphan directories created by file-merging
manager. The orphan check depends on the checkpoint notifications. So we should
tolerate rpc messages lost, but current implementation using reference counting
which does not verify the message completeness using checkpoint id. That may
cause unexpected directory deletion, although this is rare.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)