kamronis opened a new pull request, #17607: URL: https://github.com/apache/hudi/pull/17607
### Describe the issue this Pull Request addresses If MOR CDC table has compaction scheduled, then incremental read will result in incorrect data. This happens because commits metadata after compaction commit does not carry info about previous commits. Easily reproduced if you remove COMPACTION_SCHEDULE_ENABLED from testChangelogCompactionSchedule. But I also wrote another test for it. ### Summary and Changelog Add files from compaction commit to dependent logfiles list in HoodieCDCExtrator, if compaction is older then instant and is unfinished. ### Impact Fixes bug to read from table MOR CDC with scheduled compaction correctly ### Risk Level None ### Documentation Update ### Contributor's checklist - [ ] Read through [contributor's guide](https://hudi.apache.org/contribute/how-to-contribute) - [ ] Enough context is provided in the sections above - [ ] Adequate tests were added if applicable -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
