zhongqishang opened a new issue, #2553:
URL: https://github.com/apache/amoro/issues/2553

   ### Search before asking
   
   - [X] I have searched in the 
[issues](https://github.com/NetEase/amoro/issues?q=is%3Aissue) and found no 
similar issues.
   
   
   ### What would you like to be improved?
   
   For large tables written by Flink, each commit will submit an EQ DELETE file 
associated with all previous data files. Most of the generated optimize tasks 
will repeatedly read this EQ DELETE file, causing duplicate IO cost.
   
   ### How should we improve?
   
   Each JVM(taskmanager, executor) in the Optimizer generates a Cache to cache 
the EQ DELETE File.
   
   ### Are you willing to submit PR?
   
   - [x] Yes I am willing to submit a PR!
   
   ### Subtasks
   
   _No response_
   
   ### Code of Conduct
   
   - [X] I agree to follow this project's [Code of 
Conduct](https://www.apache.org/foundation/policies/conduct)
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to