[ 
https://issues.apache.org/jira/browse/HUDI-6351?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Bibhu Pala reassigned HUDI-6351:
--------------------------------

    Assignee: Bibhu Pala

> Long standing replace inflights post completion can be archived before 
> cleaning for KEEP_LATEST_FILE_VERSIONS cleaner policy.
> -----------------------------------------------------------------------------------------------------------------------------
>
>                 Key: HUDI-6351
>                 URL: https://issues.apache.org/jira/browse/HUDI-6351
>             Project: Apache Hudi
>          Issue Type: Bug
>          Components: cleaning, table-service
>            Reporter: Surya Prasanna Yalla
>            Assignee: Bibhu Pala
>            Priority: Major
>
> For KEEP_LATEST_FILE_VERSIONS cleaner policy, file versions are only 
> maintained for active file groups not for replaced file groups.
> Since, earliestCommitToRetain is null for KEEP_LATEST_FILE_VERSIONS policy, 
> last clean instant can be considered as a lower bound, since the cleaner 
> would have removed all the file groups until then. But there is a catch to 
> this logic, while cleaner is running if there is a pending replacecommit then 
> those files are not cleaned.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to