TheR1sing3un commented on issue #12619: URL: https://github.com/apache/hudi/issues/12619#issuecomment-2586467186
> 6. After compaction, is it expected that each bucket contains only one file? Is there a way to retain more historical files? The `compaction` job simply creates a new file group version for the file groups in the merge plan, and the retention of the file group version depends on the policy of the `clean` job. Your current clean policy is `KEEP_LATEST_COMMITS `, which is to make a clean judgment based on the number of commit. I guess what you need is `KEEP_LATEST_FILE_VERSIONS `, which is to reserve the file group version, please refer to the official document: https://hudi.apache.org/docs/cleaning for details -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
