yihua opened a new pull request, #10011: URL: https://github.com/apache/hudi/pull/10011
### Change Logs This PR fixes the logic of merging partial updates with projected reader schema, i.e., the reader schema contains a subset of fields from the table schema based on the query. - When processing log records in `HoodieBaseFileGroupRecordBuffer#doProcessNextDataRecord`, the schema of the combined record is also updated in the metadata since the schema can change due to partial merging; - A bug of getting the field values from the older record in `SparkRecordMergingUtils#mergePartialRecords` is fixed. - The partial update tests in `TestPartialUpdateForMergeInto` are enhanced to cover partial merging logic. ### Impact Makes sure the partial merging logic is correct. ### Risk level low ### Documentation Update N/A ### Contributor's checklist - [ ] Read through [contributor's guide](https://hudi.apache.org/contribute/how-to-contribute) - [ ] Change Logs and Impact were stated clearly - [ ] Adequate tests were added if applicable - [ ] CI passed -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
