prashantwason opened a new pull request, #8496: URL: https://github.com/apache/hudi/pull/8496
[HUDI-6101] Use UnBoundedCompactionStrategy for MDT compactions. ### Change Logs Changed the MDT compaction strategy to UnBoundedCompactionStrategy. Compaction on metadata table is used as a barrier for archiving on main dataset and for validating the deltacommits having corresponding completed commits. Therefore, we need to compact all fileslices of all partitions together requiring UnBoundedCompactionStrategy. ### Impact Corrects compaction on MDT when large amount of log data is to be compacted. Without this change, the current default strategy (LogFileSizeBasedCompactionStrategy) may only compact some file groups. ### Risk level (write none, low medium or high below) None ### Documentation Update None ### Contributor's checklist - [ ] Read through [contributor's guide](https://hudi.apache.org/contribute/how-to-contribute) - [ ] Change Logs and Impact were stated clearly - [ ] Adequate tests were added if applicable - [ ] CI passed -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
