[GitHub] [spark] maropu commented on issue #28123: [SPARK-31350][SQL] Coalesce bucketed tables for join if applicable

GitBox Thu, 09 Apr 2020 19:21:46 -0700

maropu commented on issue #28123: [SPARK-31350][SQL] Coalesce bucketed tables 
for join if applicable
URL: https://github.com/apache/spark/pull/28123#issuecomment-611844105
 
 
   > This optimization always works well? e.g., 100000 buckets vs 2 buckets.
   >> Not always, so this is enabled if spark.sql.bucketing.coalesce is set to 
true.
   
   If so, I think we need a threshold config to turn of this optimization if 
the numbers of joined buckets have a large gap.


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
[email protected]


With regards,
Apache Git Services

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[GitHub] [spark] maropu commented on issue #28123: [SPARK-31350][SQL] Coalesce bucketed tables for join if applicable

Reply via email to