pitrou commented on issue #42951:
URL: https://github.com/apache/arrow/issues/42951#issuecomment-3323605827

   For reference, the current benchmark results on my machine (AMD Zen 2 CPU, 
Ubuntu 24.04):
   ```
   BM_PlainDecodingSpacedFloat/32768/1                                          
      3347 ns         3347 ns       198613 bytes_per_second=36.4711Gi/s 
null_percent=0.01
   BM_PlainDecodingSpacedFloat/32768/100                                        
      6131 ns         6130 ns       113455 bytes_per_second=19.9137Gi/s 
null_percent=1
   BM_PlainDecodingSpacedFloat/32768/1000                                       
     29780 ns        29776 ns        22998 bytes_per_second=4.09963Gi/s 
null_percent=10
   BM_PlainDecodingSpacedFloat/32768/5000                                       
     79986 ns        79971 ns         8705 bytes_per_second=1.52643Gi/s 
null_percent=50
   BM_PlainDecodingSpacedFloat/32768/10000                                      
       981 ns          981 ns       713115 bytes_per_second=124.382Gi/s 
null_percent=100
   
   BM_PlainDecodingSpacedDouble/32768/1                                         
      6623 ns         6622 ns       107008 bytes_per_second=36.8692Gi/s 
null_percent=0.01
   BM_PlainDecodingSpacedDouble/32768/100                                       
     10482 ns        10480 ns        68795 bytes_per_second=23.2951Gi/s 
null_percent=1
   BM_PlainDecodingSpacedDouble/32768/1000                                      
     33411 ns        33405 ns        20686 bytes_per_second=7.30841Gi/s 
null_percent=10
   BM_PlainDecodingSpacedDouble/32768/5000                                      
     86465 ns        86449 ns         8093 bytes_per_second=2.8241Gi/s 
null_percent=50
   BM_PlainDecodingSpacedDouble/32768/10000                                     
      2135 ns         2135 ns       327316 bytes_per_second=114.361Gi/s 
null_percent=100
   ```
   
   We see that when the null proportion is close to 0% or 100%, performance is 
excellent. It's still relatively good when null proportion is in the middle.
   (also, this randomly-generated data is really a worst case because nulls are 
distributed evenly across the data, which might not be the case for real-world 
data)


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to