[
https://issues.apache.org/jira/browse/HUDI-6351?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Bibhu Pala reassigned HUDI-6351:
--------------------------------
Assignee: (was: Bibhu Pala)
> Long standing replace inflights post completion can be archived before
> cleaning for KEEP_LATEST_FILE_VERSIONS cleaner policy.
> -----------------------------------------------------------------------------------------------------------------------------
>
> Key: HUDI-6351
> URL: https://issues.apache.org/jira/browse/HUDI-6351
> Project: Apache Hudi
> Issue Type: Bug
> Components: cleaning, table-service
> Reporter: Surya Prasanna Yalla
> Priority: Major
>
> For KEEP_LATEST_FILE_VERSIONS cleaner policy, file versions are only
> maintained for active file groups not for replaced file groups.
> Since, earliestCommitToRetain is null for KEEP_LATEST_FILE_VERSIONS policy,
> last clean instant can be considered as a lower bound, since the cleaner
> would have removed all the file groups until then. But there is a catch to
> this logic, while cleaner is running if there is a pending replacecommit then
> those files are not cleaned.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)