Re: [PR] feat: Implement native version of ColumnarToRow [datafusion-comet]

via GitHub Wed, 06 Nov 2024 09:52:06 -0800


parthchandra commented on PR #1034:
URL: 
https://github.com/apache/datafusion-comet/pull/1034#issuecomment-2460421886


   @andygrove I tried with your change and I get - 
   ```
   Running benchmark: ColumnarToRowExec
     Running case: Spark Columnar To Row - integer
     Stopped after 35 iterations, 2037 ms
     Running case: Comet Columnar To Row - integer
     Stopped after 15 iterations, 2022 ms
   
   OpenJDK 64-Bit Server VM 11.0.19+7-LTS on Mac OS X 14.6
   Apple M3 Max
   ColumnarToRowExec:                        Best Time(ms)   Avg Time(ms)   
Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
   
------------------------------------------------------------------------------------------------------------------------
   Spark Columnar To Row - integer                      42             58       
   13        249.5           4.0       1.0X
   Comet Columnar To Row - integer                      52            135       
   36        200.9           5.0       0.8X
   ```
   So not really any better. I also tried with another change so that I 
precompute row lengths and rows start addresses to accomodate variable length 
fields and look up the row lengths as I iterate over the rows for each column. 
   That slowed down to `0.7x` of Spark.
   I don't think this is going to work, so I'm leaving this closed.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Re: [PR] feat: Implement native version of ColumnarToRow [datafusion-comet]

Reply via email to