nsivabalan commented on a change in pull request #3646:
URL: https://github.com/apache/hudi/pull/3646#discussion_r790394846



##########
File path: 
hudi-client/hudi-client-common/src/main/java/org/apache/hudi/config/HoodieCompactionConfig.java
##########
@@ -76,6 +76,11 @@
       .withDocumentation("Number of commits to retain, without cleaning. This 
will be retained for num_of_commits * time_between_commits "
           + "(scheduled). This also directly translates into how much data 
retention the table supports for incremental queries.");
 
+  public static final ConfigProperty<String> CLEANER_HOURS_RETAINED = 
ConfigProperty.key("hoodie.cleaner.hours.retained")
+          .defaultValue("24")
+          .withDocumentation("Number of hours for which commits need to be 
retained. This config provides a more flexible option as"
+          + "compared to number of commits retained for cleaning service");

Review comment:
       I feel this policy is not standalone in itself. For eg, 
numCommitsRetained and numFileVersionsRetained are self contained. But here, we 
follow similar logic we do for retain num commits. I mean, we clean up data 
files based on commits and not file versions with this new policy. If you think 
about it, in addition to configuring hours retained, probably users have to 
choose whether they want to clean up files based on commits or based on file 
versions. 
   
   Let me know wdyt. 
   Either ways, we need to add more documentation here. 




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


Reply via email to