szehon-ho commented on PR #7430: URL: https://github.com/apache/iceberg/pull/7430#issuecomment-1528384337
I think one use case for different split size, is the more advanced GDPR use cases (row level delete). If you have completely optimized the job to do a broadcast join (no shuffle), you can control the number of written files by split size. This may be different than doing a regular read where performance is the main concern. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
