vortual opened a new pull request, #7795:
URL: https://github.com/apache/hudi/pull/7795

   ### Change Logs
   BulkInsert 新增一个根据主键排序的选项:WRITE_BULK_INSERT_SORT_INPUT_BY_RECORD_KEY
   
   BulkInsert adds an option to sort by primary key: 
WRITE_BULK_INSERT_SORT_INPUT_BY_RECORD_KEY
   
   ### Impact
   数据根据主键排序的好处:后续新增数据时 flink 需要扫描的文件会变少,使用内存也会降低
   
   The advantage of sorting data according to the primary key: Flink can scan 
fewer files when new data is added later, and the memory usage will also be 
reduced
   
   ### Risk level (write none, low medium or high below)
   
   none
   
   ### Documentation Update
   
   _Describe any necessary documentation update if there is any new feature, 
config, or user-facing change_
   
   N/A
   
   ### Contributor's checklist
   
   - [ ] Read through [contributor's 
guide](https://hudi.apache.org/contribute/how-to-contribute)
   - [ ] Change Logs and Impact were stated clearly
   - [ ] Adequate tests were added if applicable
   - [ ] CI passed
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to