[GitHub] [iceberg] szehon-ho commented on pull request #7430: Allow sparksql to override target split size with session property

via GitHub Fri, 28 Apr 2023 18:19:42 -0700


szehon-ho commented on PR #7430:
URL: https://github.com/apache/iceberg/pull/7430#issuecomment-1528384337


   I think one use case for different split size, is the more advanced GDPR use 
cases (row level delete).  
   
   If you have completely optimized the job to do a broadcast join (no 
shuffle), you can control the number of written files  by split size.  This may 
be different than doing a regular read where performance is the main concern.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[GitHub] [iceberg] szehon-ho commented on pull request #7430: Allow sparksql to override target split size with session property

Reply via email to