dpengpeng commented on issue #9853:
URL: 
https://github.com/apache/incubator-gluten/issues/9853#issuecomment-2954862965

   > > [@FelixYBW](https://github.com/FelixYBW) Thank you for your response. My 
intuition is that in K8S mode, the performance of Spark tasks using local disks 
for shuffle is inferior to using Celeborn. Therefore, I speculate that the 
performance of Gluten using Celeborn will be further improved in K8S mode.
   > 
   > how much time perf boost when you switch from local disk to Celeborn using 
jvm spark? The features benefit jvm spark can also benefit Gluten.
   
   The first graph shows the performance improvement ratio of (Spark+RSS) 
compared to Spark. Two SQL queries show a decrease in performance.
   
   
![Image](https://github.com/user-attachments/assets/d688d802-664a-4276-9cda-725f45e1d3fd)
   
   
   The second graph shows the performance improvement ratio of (Gluen+RSS) 
compared to Gluten. Sixteen SQL queries show a decrease in performance.
   
   
![Image](https://github.com/user-attachments/assets/5f9ed0da-5e52-41b9-a4d4-51d952d83eab)
   
   
   From the above two graphs, it can be observed that the trend of Gluten 
differs from that of Spark. After using RSS, the performance of many SQL 
queries in Gluten has actually decreased. Could this be due to differences in 
the efficiency of operations on columnar shuffle data versus row-based shuffle 
data?
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to