On Wed, Jul 21, 2021 at 9:38 PM Nicholai Tukanov <nicholaituka...@gmail.com>
wrote:

> I would like to understand how to go about extending the SIMD framework in
> order to add support for POWER10. Specifically, I would like to add the
> following instructions: `lxvp` and `stxvp` which loads/stores 256 bits
> into/from two vectors. I believe that this will be able to give a decent
> performance boost for those on POWER machines since it can halved the
> amount of loads/stores issued.
>

Thanks for proposing this Nicholai. Hopefully someone more knowledgeable
than me can point out how to go about this.


> Additionally, matrix engines (2-D SIMD instructions) are becoming quite
> popular due to their performance improvements for deep learning and
> scientific computing. Would it be beneficial to add these new advanced SIMD
> instructions into the framework or should these instructions be left to
> libraries such as OpenBLAS and MKL?
>

 This is indeed best left to OpenBLAS, MKL et al.

Cheers,
Ralf
_______________________________________________
NumPy-Discussion mailing list
NumPy-Discussion@python.org
https://mail.python.org/mailman/listinfo/numpy-discussion

Reply via email to