Raymond Xu created HUDI-7226:
--------------------------------

             Summary: Clean by hour does not respect 
lastVersionBeforeEarliestCommitToRetain
                 Key: HUDI-7226
                 URL: https://issues.apache.org/jira/browse/HUDI-7226
             Project: Apache Hudi
          Issue Type: Improvement
          Components: cleaning
            Reporter: Raymond Xu
             Fix For: 0.12.4, 0.14.1


org.apache.hudi.table.action.clean.CleanPlanner#getFilesToCleanKeepingLatestCommits(java.lang.String,
 int, org.apache.hudi.common.model.HoodieCleaningPolicy)

lastVersionBeforeEarliestCommitToRetain is not honored by KEEP_LATEST_BY_HOURS 
policy. This essentially makes cleaner to remove the file slice when it becomes 
non-latest, regardless of the intended retention period.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to