Note that Apple has a similar library called vForce: <http://developer.apple.com/ReleaseNotes/Performance/RN-vecLib/index.html > <http://developer.apple.com/documentation/Performance/Conceptual/vecLib/Reference/reference.html >
I think these libraries use several techniques and are not necessarily dependent on SSE. The apple versions appear to only support float and double (no complex), and I don't see anything about strided arrays. At one point I thought there was talk of adding support for vForce into the respective ufuncs. I don't know if anybody followed up on that. On 2009-01-16, at 10:48, Francesc Alted wrote: > Wow, pretty nice speed-ups indeed! In fact I was thinking in > including > support for threading in Numexpr (I don't think it would be too > difficult, but let's see). BTW, do you know how VML is able to > achieve > a speedup of 6x for a sin() function? I suppose this is because they > are using SSE instructions, but, are these also available for 64-bit > double precision items? _______________________________________________ Numpy-discussion mailing list Numpy-discussion@scipy.org http://projects.scipy.org/mailman/listinfo/numpy-discussion