Thank you for your work.

On POWER9 I got the following benchmark result:

./configured:
chacha      encrypt  308.58
chacha      decrypt  325.87
./configured --enable-power-altivec "master branch":
chacha      encrypt  342.15
chacha      decrypt  356.24
./configured --enable-power-altivec "ppc-chacha-2core":
chacha      encrypt  648.97
chacha      decrypt  648.00

It's gotten better with every further optimization on the core, great work.

regards,
Mamone

On Mon, Nov 23, 2020 at 6:50 PM Niels Möller <[email protected]> wrote:

> Niels Möller <[email protected]> writes:
>
> >> It could likely be speedup further by processing 2, 3 or 4 blocks in
> >> parallel.
> >
> > I've given 2 blocks in parallel a try, but not quite working yet. My
> > work-in-progress code below.
>
> I've got it into working shape now, at least for little-endian. See
>
> https://git.lysator.liu.se/nettle/nettle/-/blob/ppc-chacha-2core/powerpc64/p7/chacha-2core.asm
>
> Next steps:
>
> 1. Fix it to work also for big-endian,
>
> 2. Wire it up for fat builds.
>
> 3. Try out if 4-way gives additional speedup.
>
> Benchmarking is appreciated. Compare the master branch to the
> ppc-chacha-2core branch, configured with --enable-power-altivec, and run
> ./examples/nettle-benchmark chacha.
>
> Regards,
> /Niels
>
> --
> Niels Möller. PGP-encrypted email is preferred. Keyid 368C6677.
> Internet email is subject to wholesale government surveillance.
> _______________________________________________
> nettle-bugs mailing list
> [email protected]
> http://lists.lysator.liu.se/mailman/listinfo/nettle-bugs
>
_______________________________________________
nettle-bugs mailing list
[email protected]
http://lists.lysator.liu.se/mailman/listinfo/nettle-bugs

Reply via email to