Hi Pavel,
Pavel Semjanov wrote:
Hello again,
as I promised, here is the optimized code for SHA-256 hash, x86
platform. Should work faster on Core 2/iX up to 20%. This code you are
free to use (or modify) in any form on OpenSSL and GRYPTOGAMS. I guess
you should make it PIC, as any other code for x86 (I didn't make it
because I don't need it in my projects).
FWIW:
I've grabbed this .pl file , downloaded openssl 1.0.0j and compared the
performance of 'openssl speed sha256' with and without the patch;
initially I found *NO* noticable performance difference on any of the
64bit machines I tested . Then it occurred to me that the patch was for
the 32bit version only (the file sha512-x86_64.pl also covers sha256); I
modified the 'Configure' script to allow the compilation of a 32bit
version of openssl *with* the assembly routines. The results for this
version are on various Intel CPUs
Core2 E6550 (Conroe): 22 - 32 % speed up
Xeon E5440 (Harpertown): 24 - 33% speed up
Xeon X5660 (Westmere-EP): 19 - 27% speed up
i5-560M (Arrandale): 18 - 23 % speed up
Note that for the i5-560M the unpatched 64bit version still outperforms
the patched 32bit version....
How can the sha256 patch be applied to the 64bit code base?
cheers,
JJK / Jan Just Keijser
______________________________________________________________________
OpenSSL Project http://www.openssl.org
Development Mailing List [email protected]
Automated List Manager [email protected]