[
https://issues.apache.org/jira/browse/HUDI-349?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
sivabalan narayanan updated HUDI-349:
-------------------------------------
Sprint: Cont' improve - 2022/02/07, Cont' improve - 2022/02/14 (was:
Cont' improve - 2022/02/07)
> Make cleaner retention based on time period to account for higher deviations
> in ingestion runs
> ----------------------------------------------------------------------------------------------
>
> Key: HUDI-349
> URL: https://issues.apache.org/jira/browse/HUDI-349
> Project: Apache Hudi
> Issue Type: Task
> Components: cleaning, writer-core
> Reporter: Balaji Varadarajan
> Assignee: sivabalan narayanan
> Priority: Major
> Labels: core-flow-ds, new-to-hudi, pull-request-available,
> sev:high
> Fix For: 0.11.0
>
> Original Estimate: 2h
> Remaining Estimate: 2h
>
> Cleaner by commits is based on number of commits to be retained. Ingestion
> time could vary across runs due to various factors. For providing a bound on
> the maximum running time for a query and for providing consistent retention
> period, it is better to use a retention config based on time (e:g 12h)
--
This message was sent by Atlassian Jira
(v8.20.1#820001)