estoianovici opened a new issue, #6789:
URL: https://github.com/apache/incubator-gluten/issues/6789

   ### Backend
   
   VL (Velox)
   
   ### Bug description
   
   There is a significant regression on TPC-DS between releases 1.1 and 1.2. 
For example, q95 went from ~165s to ~315s on TPC-DS 30TB and it looks like hash 
build/hash probe operations are the ones which have regressed the most (see 
images)
   ![spark-3 5-gluten-1 
2-rc1](https://github.com/user-attachments/assets/fc2e577e-83b5-4feb-8001-2c90e78929b7)
   ![spark3 3-gluten-1 
1](https://github.com/user-attachments/assets/a5266bee-8ee9-4829-802c-87cecce15281)
   
   Other queries with similar regressions caused by hash operations are 
q23a(240s to 440s), q23b (similar to q23a), q47(85s to 101s)
   
   I was able to reproduce this issue on both spark 3.4 and spark 3.5 - this is 
not a query plan change issue. I believe the regression happens in Velox but 
I'm not sure how to isolate it.
   
   ### Spark version
   
   Spark-3.4.x
   
   ### Spark configurations
   
   _No response_
   
   ### System information
   
   _No response_
   
   ### Relevant logs
   
   _No response_


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to