On Mon, 17 Feb 2025 13:53:30 GMT, Ferenc Rakoczi <d...@openjdk.org> wrote:
> By using the aarch64 vector registers the speed of the computation of the > ML-KEM algorithms (key generation, encapsulation, decapsulation) can be > approximately doubled. ML-KEM benchmark results of this PR: MLKEM.decapsulate 512 11.80 us/op MLKEM.decapsulate 768 18.19 us/op MLKEM.decapsulate 1024 29.57 us/op MLKEM.encapsulate 512 8.80 us/op MLKEM.encapsulate 768 13.49 us/op MLKEM.encapsulate 1024 22.53 us/op MLKEM.keygen 512 7.49 us/op MLKEM.keygen 768 11.22 us/op MLKEM.keygen 1024 19.08 us/op ML-KEM no intrinsics MLKEM.decapsulate 512 31.23 us/op MLKEM.decapsulate 768 50.09 us/op MLKEM.decapsulate 1024 75.92 us/op MLKEM.encapsulate 512 22.72 us/op MLKEM.encapsulate 768 37.27 us/op MLKEM.encapsulate 1024 59.69 us/op MLKEM.keygen 512 17.95 us/op MLKEM.keygen 768 30.95 us/op MLKEM.keygen 1024 49.04 us/op ------------- PR Comment: https://git.openjdk.org/jdk/pull/23663#issuecomment-2683631601