Re: RFR: 8360934: Add AVX-512 intrinsics for ML-KEM - enhancement on AVX512_VBMI [v5]

duke Fri, 16 Jan 2026 22:29:34 -0800

On Sun, 11 Jan 2026 06:59:20 GMT, Shawn M Emery <[email protected]> wrote:


>> This change allows use of the AVX512_VBMI instruction set to further 
>> optimize decompression/parsing of polynomial coefficients for ML-KEM.  The 
>> speedup gained in the ML-KEM benchmarks for key generation is between 0.4 to 
>> 0.5%, encapsulation is  0.2 to 1.7%, and decapsulation is 0.3 to 2.0%.
>> 
>> Thank you to @sviswa7 and @ferakocz for their help in working through the 
>> early stages of this code with me.
>
> Shawn M Emery has updated the pull request incrementally with one additional 
> commit since the last revision:
> 
>   Update to use OptoLoopAlignment for VBMILoop

@smemery 
Your change (at version f278a63fff4a9f268803a1e2e5fbad260d29d11c) is now ready 
to be sponsored by a Committer.

-------------

PR Comment: https://git.openjdk.org/jdk/pull/28815#issuecomment-3762780881

Re: RFR: 8360934: Add AVX-512 intrinsics for ML-KEM - enhancement on AVX512_VBMI [v5]

Reply via email to