[
https://issues.apache.org/jira/browse/HUDI-349?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
sivabalan narayanan updated HUDI-349:
-------------------------------------
Labels: core-flow-ds pull-request-available sev:high (was:
pull-request-available)
> Make cleaner retention based on time period to account for higher deviations
> in ingestion runs
> ----------------------------------------------------------------------------------------------
>
> Key: HUDI-349
> URL: https://issues.apache.org/jira/browse/HUDI-349
> Project: Apache Hudi
> Issue Type: Task
> Components: Cleaner, newbie, Writer Core
> Reporter: Balaji Varadarajan
> Assignee: Pratyaksh Sharma
> Priority: Major
> Labels: core-flow-ds, pull-request-available, sev:high
>
> Cleaner by commits is based on number of commits to be retained. Ingestion
> time could vary across runs due to various factors. For providing a bound on
> the maximum running time for a query and for providing consistent retention
> period, it is better to use a retention config based on time (e:g 12h)
--
This message was sent by Atlassian Jira
(v8.20.1#820001)