alamb commented on issue #5230:
URL: 
https://github.com/apache/arrow-datafusion/issues/5230#issuecomment-1435052501

   Here are my performance results (I did not dig into this yet). Basically it 
shows no speedup or slow downs
   
   Methodology:
   ```shell
   git checkout sort-preserve-row-encoding
   git cherry-pick 322e92bea6e28ada9f8d57d9429748fb58b2a2a5
   cargo bench -p datafusion --bench sort -- --save-baseline 
sort-preserve-row-encoding
   ```
   
   Results
   ```
   critcmp main sort-preserve-row-encoding
   group                                                     main               
                     sort-preserve-row-encoding
   -----                                                     ----               
                     --------------------------
   sort f64                                                  1.02  
655.8±233.27µs        ? ?/sec     1.00   640.9±24.78µs        ? ?/sec
   sort f64 preserve partitioning                            1.00      
5.1±0.09ms        ? ?/sec     1.02      5.2±0.10ms        ? ?/sec
   sort i64                                                  1.05  
599.4±420.69µs        ? ?/sec     1.00   571.7±11.11µs        ? ?/sec
   sort i64 preserve partitioning                            1.00      
4.5±0.08ms        ? ?/sec     1.02      4.6±0.09ms        ? ?/sec
   sort mixed tuple                                          1.00   
597.4±27.16µs        ? ?/sec     2.30  1376.1±69.26µs        ? ?/sec
   sort mixed tuple preserve partitioning                    1.00      
4.9±0.13ms        ? ?/sec     2.30     11.3±0.39ms        ? ?/sec
   sort mixed utf8 dictionary tuple                          1.00  
640.5±356.39µs        ? ?/sec     1.39   892.1±43.52µs        ? ?/sec
   sort mixed utf8 dictionary tuple preserve partitioning    1.00      
5.1±0.12ms        ? ?/sec     1.42      7.3±0.15ms        ? ?/sec
   sort utf8 dictionary                                      1.00    
200.7±4.89µs        ? ?/sec     1.00    200.4±4.35µs        ? ?/sec
   sort utf8 dictionary preserve partitioning                1.00  
1767.3±218.98µs        ? ?/sec    1.00  1758.6±96.26µs        ? ?/sec
   sort utf8 dictionary tuple                                1.00  
683.6±1094.18µs        ? ?/sec    1.24  846.7±107.74µs        ? ?/sec
   sort utf8 dictionary tuple preserve partitioning          1.00      
4.8±0.24ms        ? ?/sec     1.33      6.4±0.19ms        ? ?/sec
   sort utf8 high cardinality                                1.00      
2.1±0.04ms        ? ?/sec     1.01      2.1±0.06ms        ? ?/sec
   sort utf8 high cardinality preserve partitioning          1.01     
17.0±0.34ms        ? ?/sec     1.00     16.9±0.39ms        ? ?/sec
   sort utf8 low cardinality                                 1.00  
1258.3±84.71µs        ? ?/sec     1.02  1280.3±30.47µs        ? ?/sec
   sort utf8 low cardinality preserve partitioning           1.00  
1264.9±119.28µs        ? ?/sec    1.01  1278.6±52.69µs        ? ?/sec
   sort utf8 tuple                                           1.00  
1127.2±41.81µs        ? ?/sec     1.53  1723.5±59.90µs        ? ?/sec
   sort utf8 tuple preserve partitioning                     1.00      
9.4±0.26ms        ? ?/sec     1.53     14.4±0.44ms        ? ?/sec
   alamb@aal-dev:~/arrow-datafusion$
   ```
   
   I plan to next run a prof run to see where time is being spent


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to