TheR1sing3un commented on issue #12139:
URL: https://github.com/apache/hudi/issues/12139#issuecomment-2449436800
> We can get the accurate size of a file slice right, can that be utilitized
for optimization?
Of course. However, the execution efficiency of the task is not only
affected by the size of all the files in slice, but also by the data repetition
rate, slice layout and other factors. We can find the experience value of
common scenarios through testing, and use it as the default optimization
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]