vortual opened a new pull request, #7795: URL: https://github.com/apache/hudi/pull/7795
### Change Logs BulkInsert 新增一个根据主键排序的选项:WRITE_BULK_INSERT_SORT_INPUT_BY_RECORD_KEY BulkInsert adds an option to sort by primary key: WRITE_BULK_INSERT_SORT_INPUT_BY_RECORD_KEY ### Impact 数据根据主键排序的好处:后续新增数据时 flink 需要扫描的文件会变少,使用内存也会降低 The advantage of sorting data according to the primary key: Flink can scan fewer files when new data is added later, and the memory usage will also be reduced ### Risk level (write none, low medium or high below) none ### Documentation Update _Describe any necessary documentation update if there is any new feature, config, or user-facing change_ N/A ### Contributor's checklist - [ ] Read through [contributor's guide](https://hudi.apache.org/contribute/how-to-contribute) - [ ] Change Logs and Impact were stated clearly - [ ] Adequate tests were added if applicable - [ ] CI passed -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
