Surya Prasanna Yalla created HUDI-6351:
------------------------------------------

             Summary: Long standing replace inflights post completion can be 
archived before cleaning for KEEP_LATEST_FILE_VERSIONS cleaner policy.
                 Key: HUDI-6351
                 URL: https://issues.apache.org/jira/browse/HUDI-6351
             Project: Apache Hudi
          Issue Type: Bug
            Reporter: Surya Prasanna Yalla


For KEEP_LATEST_FILE_VERSIONS cleaner policy, file versions are only maintained 
for active file groups not for replaced file groups.

Since, earliestCommitToRetain is null for KEEP_LATEST_FILE_VERSIONS policy, 
last clean instant can be considered as a lower bound, since the cleaner would 
have removed all the file groups until then. But there is a catch to this 
logic, while cleaner is running if there is a pending replacecommit then those 
files are not cleaned.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to