JingsongLi opened a new pull request, #5787:
URL: https://github.com/apache/paimon/pull/5787

   
   <!-- Please specify the module before the PR name: [core] ... or [flink] ... 
-->
   
   ### Purpose
   
   <!-- Linking this pull request to the issue -->
   At present, each write operator initializes every time a new bucket appears, 
and the initialization process restores the historical data files. Each bucket 
performs this process once, which is very wasteful and results in a large 
amount of duplicate IO.
   
   This PR uses Flink's operator coordinator capability to make this restore a 
global single point execution, so that we can add cache in the coordinator to 
optimize the issue of duplicate IO.
   
   <!-- What is the purpose of the change -->
   
   ### Tests
   
   <!-- List UT and IT cases to verify this change -->
   
   ### API and Format
   
   <!-- Does this change affect API or storage format -->
   
   ### Documentation
   
   <!-- Does this change introduce a new feature -->
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@paimon.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

Reply via email to