Fwd: PPC64LE optimizing AES and GHASH

2020-06-22 Thread Maamoun TK
On Sat, Jun 20, 2020 at 11:54 AM Niels Möller wrote: > Have you measured speedup when going from 4 to 8 blocks? We shouldn't > add larger loops than needed. > The 8x loop has x~1.15 performance boost over 4x loop, if you think it's not worth it, I can add only 4x loop to make the code simpler.

Re: PPC64LE optimizing AES and GHASH

2020-06-20 Thread Niels Möller
Maamoun TK writes: > I added a PowerPC64LE optimized version of AES and GHASH to nettle. Cool. I haven't yet looked at the patches, but some general comments: >- The main equation: The main equation for 4 block (128-bit each) can be >seen in reference [1] Digest =

Re: PPC64LE optimizing AES and GHASH

2020-06-20 Thread Maamoun TK
diff -urN nettle/configure.ac nettle_PowerPC64LE/configure.ac --- nettle/configure.ac 2020-06-08 08:42:20.0 +0300 +++ nettle_PowerPC64LE/configure.ac 2020-06-15 18:41:43.485342900 +0300 @@ -435,6 +435,9 @@ esac fi ;; +*powerpc64le*) + asm_path=powerpc64le + ;;

Re: PPC64LE optimizing AES and GHASH

2020-06-18 Thread Maamoun TK
On Thu, Jun 18, 2020 at 6:58 PM Maamoun TK wrote: > I added a PowerPC64LE optimized version of AES and GHASH to nettle. > Patch summary: > > GHASH Algorithm > > I took the advantage of several references and researches to achieve the > high-speed implementation of this algorithm. These

PPC64LE optimizing AES and GHASH

2020-06-18 Thread Maamoun TK
I added a PowerPC64LE optimized version of AES and GHASH to nettle. Patch summary: GHASH Algorithm I took the advantage of several references and researches to achieve the high-speed implementation of this algorithm. These references include several techniques that have been used to improve the