bowenliang123 commented on PR #7932:
URL: https://github.com/apache/iceberg/pull/7932#issuecomment-1621211436

   > @bowenliang123 @ConeyLiu i understand REBALANCE_PARTITIONS_BY_COL this 
adds a adaptive coalesce(AQE) which just coalesces the partitions local to 
executor( hence reducing number of files written) . Is this effective if the 
partitions are spread across different workers since the partitions wont be 
local anymore( for coalesce to work) ?
   
   <img width="427" alt="image" 
src="https://github.com/apache/iceberg/assets/1935105/04947fd7-425c-4253-a20e-eaddf46afdbe";>
   Since RebalancePartitions is introduced, a shuffle read stage was 
introduced. So I think it works for partitions across worker nodes.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to