On 19/5/22 16:50, [email protected] wrote:
Hello,
I am working on speeding up NumPy with the AArch64 SVE instruction set.
I could not find a numpy implementation for SVE. Is there already a test
implementation or discussion about SVE support?
OpenBLAS apparently has SVE support from 0.3.20 [0], which was merged to
NumPy yesterday.
So far we have only support for NEON and ASIMD in NumPy, see [1] for a
description of the way we use intrinsics. Contributions to improve the
implementations and the documentation would be welcome. We currently use
travis CI to run our aarch64 tests, if there was a way to get access to
more advanced machines that also would be good. Do you know of
commercially available machines with SVE or SVE2 support?
Matti
[0]
https://github.com/xianyi/OpenBLAS/blob/faf58d2b3ffb20fd334cab080700be564ef74ec2/Changelog.txt#L28
[1] https://numpy.org/devdocs/reference/simd/build-options.html
_______________________________________________
NumPy-Discussion mailing list -- [email protected]
To unsubscribe send an email to [email protected]
https://mail.python.org/mailman3/lists/numpy-discussion.python.org/
Member address: [email protected]