dmatth1 commented on PR #10011:
URL: https://github.com/apache/arrow-rs/pull/10011#issuecomment-4526028953
Great callout. Measured bench and the numbers with autovectorization are
better:
Same-host, same-session medians (Cascade Lake-class Xeon @ 2.8 GHz), via the
public `Sbbf::{check,insert}` API (XXH64 + probe), criterion default profile,
ns/op:
| Regime | Path | Scalar | Autovec (this) | Hand-written AVX2 | Autovec
vs scalar | Autovec vs hand-written |
|-----------|--------|-------:|---------------:|------------------:|------------------:|------------------------:|
| S 128 KiB | miss | 13.02 | **4.96** | 5.14 |
2.62× | +4% |
| S 128 KiB | hit | 13.47 | **4.95** | 5.20 |
2.72× | +5% |
| S 128 KiB | insert | 11.62 | **5.41** | 5.38 |
2.15× | tied |
| M 2 MiB | miss | 18.88 | **7.47** | 8.18 |
2.53× | +9% |
| M 2 MiB | hit | 18.12 | **7.22** | 8.01 |
2.51× | +11% |
| M 2 MiB | insert | 14.99 | **8.45** | 8.59 |
1.77× | tied |
| L 32 MiB | miss | 27.56 | **11.07** | 13.47 |
2.49× | +18% |
| L 32 MiB | hit | 26.57 | **11.23** | 13.40 |
2.37× | +16% |
| L 32 MiB | insert | 23.53 | **12.77** | 12.60 |
1.84× | tied |
Changes here:
https://github.com/apache/arrow-rs/compare/main...dmatth1:arrow-rs:sbbf-autovec-tf
This unlocks Neon/aarch64 so I will bundle those numbers into the next
revision.
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]