> This change allows use of the AVX512_VBMI/VMBI2 instruction set to further > optimize decompression/parsing of polynomial coefficients for ML-KEM. The > speedup gained in the ML-KEM benchmarks for key generation is between 0.3 to > 0.6%, encapsulation is 0.4 to 1.7%, and decapsulation is 0.1 to 0.9%. > > Thank you to @sviswa7 and @ferakocz for their help in working through the > early stages of this code with me.
Shawn M Emery has updated the pull request incrementally with one additional commit since the last revision: 8360934: Add AVX-512 intrinsics for ML-KEM - enhancement on AVX512_VBMI Change Swap to Dup named function/variable Check for only VBMI support (not VBMI2) ------------- Changes: - all: https://git.openjdk.org/jdk/pull/28815/files - new: https://git.openjdk.org/jdk/pull/28815/files/7cd8de53..4af75963 Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=28815&range=02 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=28815&range=01-02 Stats: 5 lines in 1 file changed: 0 ins; 0 del; 5 mod Patch: https://git.openjdk.org/jdk/pull/28815.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/28815/head:pull/28815 PR: https://git.openjdk.org/jdk/pull/28815
