[GitHub] [arrow-rs] jhorstmann commented on pull request #4560: Restructure `sum` for better auto-vectorization

via GitHub Sat, 22 Jul 2023 08:18:08 -0700


jhorstmann commented on PR #4560:
URL: https://github.com/apache/arrow-rs/pull/4560#issuecomment-1646605414


   Interesting, I can reproduce the benchmark results on my machine (i9-11900KB 
with AVX-512), the version without simd feature is even slightly faster. Very 
nice improvement!
   
   With `target-cpu=skylake` there is still a small difference for the nullable 
version, 70ns vs 56ns with simd feature. The packed_simd code is not taking 
full advantage of avx512 mask registers and therefore runs at the same speed 
when targeting either skylake or native.
   
   What cpu did you run your benchmarks on?


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

[GitHub] [arrow-rs] jhorstmann commented on pull request #4560: Restructure `sum` for better auto-vectorization

Reply via email to