[GitHub] [spark] ulysses-you commented on pull request #32683: [SPARK-35540][SQL] Make config maxShuffledHashJoinLocalMapThreshold fallback to advisoryPartitionSizeInBytes

GitBox Tue, 01 Jun 2021 07:49:29 -0700


ulysses-you commented on pull request #32683:
URL: https://github.com/apache/spark/pull/32683#issuecomment-852188051



   @maropu  After taking a deep look, I think it's about compression ratio that 
the extreme case @JkSelf  metioned above.
   
   Here is some pictures of `q72`.  We can see the 60MB data contains 
4.9million rows.
   
   
![image](https://user-images.githubusercontent.com/12025282/120342780-b1905100-c32a-11eb-8156-32a0e8edf199.png)
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]



---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[GitHub] [spark] ulysses-you commented on pull request #32683: [SPARK-35540][SQL] Make config maxShuffledHashJoinLocalMapThreshold fallback to advisoryPartitionSizeInBytes

Reply via email to