>>> More coming in. Attached are ARMv8 modules. It might be worth noting that small-block performance of all presented Poly1305 modules can be improved by postponing pre-computations for vector code. I mean provided that a) non-vector initialization procedure is totally inexpensive, while b) vector initialization takes equivalent of several blocks; you can win by avoiding latter in cases when you have to process small amount of blocks, even if they will be processed by slower non-vector procedure. This can be easily arranged.
chacha-armv8.pl
Description: Perl program
poly1305-armv8.pl
Description: Perl program
_______________________________________________ openssl-dev mailing list To unsubscribe: https://mta.openssl.org/mailman/listinfo/openssl-dev
