Re: Optimizing salsa20

2020-07-22 Thread Niels Möller
Michael Weiser writes: > sorry for the delay - I've been on vacation. No problem. If you can test and debug arm big-endian, that's apprecated. > We still have the ARM BE CI ready to go. Is it maybe time to get it > activated on GitLab? I've put it in an MR for reference >

Re: Optimizing salsa20

2020-07-21 Thread Michael Weiser
Hello Niels, sorry for the delay - I've been on vacation. On Thu, Jul 09, 2020 at 04:05:21PM +0200, Niels Möller wrote: > This will break support for big-endian ARM for > now, since I'm not able to test that. We still have the ARM BE CI ready to go. Is it maybe time to get it activated on

Re: Optimizing salsa20

2020-07-10 Thread Niels Möller
ni...@lysator.liu.se (Niels Möller) writes: > I'd like to do the same for x86_64. I've now tried the same interleaving for salsa20 on x86_64, and it gives a 25% speedup on my laptop. Pushed to a new branch, x86_64-salsa20-2core. Regards, /Niels -- Niels Möller. PGP-encrypted email is

Re: Optimizing salsa20

2020-07-09 Thread Niels Möller
ni...@lysator.liu.se (Niels Möller) writes: > It interleaves the processing of two blocks, which gives a speedup of > 50% -- 100% on the ARM cores where I've tested it. Before merging, I > need to fix fat builds to use the new code on processors that support > it. I've added the fat build

Re: Optimizing salsa20

2020-07-09 Thread Maamoun TK
I would like to help but I have no clue or experience with ARM NEON, sorry. regards, Mamone On Tue, Jul 7, 2020 at 5:46 PM Niels Möller wrote: > I've written some new ARM Neon assembly for salsa20. See > > https://gitlab.com/gnutls/nettle/-/commit/2ac58a1ce729a6cfe1d3703f4deb6da8862909e9 > , >

Optimizing salsa20

2020-07-07 Thread Niels Möller
I've written some new ARM Neon assembly for salsa20. See https://gitlab.com/gnutls/nettle/-/commit/2ac58a1ce729a6cfe1d3703f4deb6da8862909e9, when configured with --enable-arm-neon. It interleaves the processing of two blocks, which gives a speedup of 50% -- 100% on the ARM cores where I've tested