[ 
https://issues.apache.org/jira/browse/HUDI-6252?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ethan Guo updated HUDI-6252:
----------------------------
    Component/s: cleaning

> Deduce archival config based on hours derived from cleaner hours based config
> -----------------------------------------------------------------------------
>
>                 Key: HUDI-6252
>                 URL: https://issues.apache.org/jira/browse/HUDI-6252
>             Project: Apache Hudi
>          Issue Type: Improvement
>          Components: archiving, cleaning, table-service
>            Reporter: sivabalan narayanan
>            Priority: Major
>
> We have hours based policy/configs for cleaning, but archival only has 
> num_commits based configs. So, as of now, user/dev has to ensure they both 
> align well together. We should derive archival configs automatically from 
> cleaner configs. 
>  
> for eg, 
> if cleaner configs is 48 hours, 
> we can add 10% more for archival min commits and 20% for max commits. 
> So, archival window can be 53 hours -> 58 hours. we can expose configs for 
> these percentage if need be. 



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to