somani commented on PR #35686:
URL: https://github.com/apache/spark/pull/35686#issuecomment-1107807027

   @c21 @cloud-fan This has caused a performance regression in our tests where 
broadcast hash join is 5x slower.
   It can be reproduced easily on tpcds 3tb data with the following query:
   ```select sum(
   ws_ext_sales_price
   ) sun_sales, count(*)
   from
   web_sales, date_dim where ws_sold_date_sk = d_date_sk```
   
   I could not figure out why it caused a regression, but it is clear it goes 
away on reverting the commit.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to