On Sat, Jun 20, 2020 at 11:54 AM Niels Möller wrote:
> Have you measured speedup when going from 4 to 8 blocks? We shouldn't
> add larger loops than needed.
>
The 8x loop has x~1.15 performance boost over 4x loop, if you think it's
not worth it, I can add only 4x loop to make the code simpler.
Maamoun TK writes:
> I added a PowerPC64LE optimized version of AES and GHASH to nettle.
Cool. I haven't yet looked at the patches, but some general comments:
>- The main equation: The main equation for 4 block (128-bit each) can be
>seen in reference [1] Digest =
diff -urN nettle/configure.ac nettle_PowerPC64LE/configure.ac
--- nettle/configure.ac 2020-06-08 08:42:20.0 +0300
+++ nettle_PowerPC64LE/configure.ac 2020-06-15 18:41:43.485342900 +0300
@@ -435,6 +435,9 @@
esac
fi
;;
+*powerpc64le*)
+ asm_path=powerpc64le
+ ;;
On Thu, Jun 18, 2020 at 6:58 PM Maamoun TK
wrote:
> I added a PowerPC64LE optimized version of AES and GHASH to nettle.
> Patch summary:
>
> GHASH Algorithm
>
> I took the advantage of several references and researches to achieve the
> high-speed implementation of this algorithm. These
I added a PowerPC64LE optimized version of AES and GHASH to nettle.
Patch summary:
GHASH Algorithm
I took the advantage of several references and researches to achieve the
high-speed implementation of this algorithm. These references include
several techniques that have been used to improve the