[I] [EPIC] Improve Comet planning [datafusion-comet]

via GitHub Mon, 20 Apr 2026 12:10:16 -0700


andygrove opened a new issue, #4005:
URL: https://github.com/apache/datafusion-comet/issues/4005


   ### What is the problem the feature request solves?
   
   Comet currently eagerly tries to convert each operator to a Comet operator. 
This sometimes leads to inefficient plans that can be slower than just falling 
back to Spark.
   
   One example is where a Comet JVM shuffle is inserted for a child plan that 
runs in Spark, to convert to columner shuffle, and then the next stage converts 
back to row-based right away.
   
   ```
   +- HashAggregate
               +- CometNativeColumnarToRow
                  +- CometColumnarExchange
                     +- HashAggregate
   ```
   
   Another example we saw recently was with DPP fallback, and there was a fix 
applied for that specific case.
   
   Document to discuss:
   
   
https://docs.google.com/document/d/1ux_SwZPd64VTtuTC9lC46oIZGKcoH6KhWOS8xlqwdx8/edit?usp=sharing
   
   ### Describe the potential solution
   
   _No response_
   
   ### Additional context
   
   _No response_


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[I] [EPIC] Improve Comet planning [datafusion-comet]

Reply via email to