[ 
https://issues.apache.org/jira/browse/HUDI-8077?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Danny Chen updated HUDI-8077:
-----------------------------
    Description: 
Currently, the incremental cleaning will remember a marker instant of last 
retained in the commit metadata, both the marker and the filtering instant on 
fs view are start times(instant times), while this is okay for most of the 
cases because we actually have some buffer time for cleaning(30 commits 
retained by default), but if the user sets up a very redical strategy like 
clean for every commits, then there might be isses in NB-CC mode:

A instant that starts very early but finished recently might be skipped by the 
cleaning table service.

> Fix the incremental cleaning to base on completion time
> -------------------------------------------------------
>
>                 Key: HUDI-8077
>                 URL: https://issues.apache.org/jira/browse/HUDI-8077
>             Project: Apache Hudi
>          Issue Type: Sub-task
>            Reporter: Danny Chen
>            Priority: Major
>             Fix For: 1.0.0
>
>
> Currently, the incremental cleaning will remember a marker instant of last 
> retained in the commit metadata, both the marker and the filtering instant on 
> fs view are start times(instant times), while this is okay for most of the 
> cases because we actually have some buffer time for cleaning(30 commits 
> retained by default), but if the user sets up a very redical strategy like 
> clean for every commits, then there might be isses in NB-CC mode:
> A instant that starts very early but finished recently might be skipped by 
> the cleaning table service.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to