aokolnychyi commented on issue #77: Include the cost to open a file during split planning URL: https://github.com/apache/incubator-iceberg/issues/77#issuecomment-466341221 Exactly, it is hard to estimate what would perform better as it would highly depend on a particular use case. Let's go for the min file weight. It is simple, it will prevent Iceberg from putting a huge number of files into one bin and the bin size will be closer to the target size in case of average files. Let me submit a PR.
---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected] With regards, Apache Git Services --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
