They are meant to be optimized. Any contribution to improve them further is 
more than welcome. 

Raghuveer

-----Original Message-----
From: Noah Goldstein <goldstein....@gmail.com> 
Sent: Thursday, November 4, 2021 10:46 AM
To: numpy-discussion@python.org
Subject: [Numpy-discussion] [RFC] - numpy/SVML appears to be poorly optimized

The numpy SVML library: https://github.com/numpy/SVML

appears to be poorly optimized. Since its just the raw assembly dump this also 
makes it quite difficult to improve (with either a better compiler or by hand).

Some of the glaring issues are:
1. register allocation / spilling
2. rodata layouts / const-propagation of the values.
3. Very odd use of internal functions that really ought to be inlined.

Are these functions meant to be heavily optimized?

If so, are people open to patches that optimize them (either with new C 
implementations are in the current assembly implementations).
_______________________________________________
NumPy-Discussion mailing list -- numpy-discussion@python.org To unsubscribe 
send an email to numpy-discussion-le...@python.org 
https://mail.python.org/mailman3/lists/numpy-discussion.python.org/
Member address: raghuveer.devulapa...@intel.com
_______________________________________________
NumPy-Discussion mailing list -- numpy-discussion@python.org
To unsubscribe send an email to numpy-discussion-le...@python.org
https://mail.python.org/mailman3/lists/numpy-discussion.python.org/
Member address: arch...@mail-archive.com

Reply via email to