TheR1sing3un commented on issue #12619:
URL: https://github.com/apache/hudi/issues/12619#issuecomment-2586467186

   > 6. After compaction, is it expected that each bucket contains only one 
file? Is there a way to retain more historical files?
   
   The `compaction` job simply creates a new file group version for the file 
groups in the merge plan, and the retention of the file group version depends 
on the policy of the `clean` job. Your current clean policy is 
`KEEP_LATEST_COMMITS `, which is to make a clean judgment based on the number 
of commit. I guess what you need is `KEEP_LATEST_FILE_VERSIONS `, which is to 
reserve the file group version, please refer to the official document: 
https://hudi.apache.org/docs/cleaning for details
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to