[GitHub] [arrow-rs] tustvold commented on pull request #2038: Implement FromIterator for Builders

GitBox Fri, 15 Jul 2022 15:40:26 -0700


tustvold commented on PR #2038:
URL: https://github.com/apache/arrow-rs/pull/2038#issuecomment-1186002441


   Unfortunately this still represents a non-trivial performance regression for 
strings...
   
   ```
   array_string_from_vec 128                                                    
                         
                           time:   [3.0274 us 3.0277 us 3.0279 us]
                           change: [+49.381% +49.449% +49.526%] (p = 0.00 < 
0.05)
                           Performance has regressed.
   Found 12 outliers among 100 measurements (12.00%)
     1 (1.00%) low mild
     7 (7.00%) high mild
     4 (4.00%) high severe
   
   array_string_from_vec 256                                                    
                         
                           time:   [4.2068 us 4.2092 us 4.2123 us]
                           change: [+37.016% +37.296% +37.749%] (p = 0.00 < 
0.05)
                           Performance has regressed.
   Found 5 outliers among 100 measurements (5.00%)
     2 (2.00%) high mild
     3 (3.00%) high severe
   
   array_string_from_vec 512                                                    
                         
                           time:   [6.6659 us 6.6680 us 6.6700 us]
                           change: [+34.300% +34.358% +34.416%] (p = 0.00 < 
0.05)
                           Performance has regressed.
   Found 5 outliers among 100 measurements (5.00%)
     4 (4.00%) high mild
     1 (1.00%) high severe
   
   struct_array_from_vec 128                                                    
                         
                           time:   [4.2799 us 4.2823 us 4.2844 us]
                           change: [+38.535% +38.605% +38.688%] (p = 0.00 < 
0.05)
                           Performance has regressed.
   Found 1 outliers among 100 measurements (1.00%)
     1 (1.00%) high mild
   
   struct_array_from_vec 256                                                    
                         
                           time:   [6.2571 us 6.2585 us 6.2599 us]
                           change: [+40.854% +40.924% +40.986%] (p = 0.00 < 
0.05)
                           Performance has regressed.
   Found 3 outliers among 100 measurements (3.00%)
     1 (1.00%) low mild
     1 (1.00%) high mild
     1 (1.00%) high severe
   
   struct_array_from_vec 512                                                    
                         
                           time:   [9.6797 us 9.6819 us 9.6845 us]
                           change: [+35.304% +35.364% +35.419%] (p = 0.00 < 
0.05)
                           Performance has regressed.
   Found 5 outliers among 100 measurements (5.00%)
     3 (3.00%) high mild
     2 (2.00%) high severe
   
   struct_array_from_vec 1024                                                   
                          
                           time:   [16.488 us 16.493 us 16.500 us]
                           change: [+33.968% +34.039% +34.116%] (p = 0.00 < 
0.05)
                           Performance has regressed.
   Found 10 outliers among 100 measurements (10.00%)
     5 (5.00%) high mild
     5 (5.00%) high severe
   ```
   
   I wonder if you might like to try using something like 
[hotspot](https://github.com/KDAB/hotspot) or 
[cargo-flamegraph](https://github.com/flamegraph-rs/flamegraph) to see where 
the additional slowdown is coming from? I can try to take a look, but I'm a 
little bit swamped at the moment so not sure when I'll have time to investigate 
this


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

[GitHub] [arrow-rs] tustvold commented on pull request #2038: Implement FromIterator for Builders

Reply via email to