Michael Weiser writes:
> sorry for the delay - I've been on vacation.
No problem. If you can test and debug arm big-endian, that's apprecated.
> We still have the ARM BE CI ready to go. Is it maybe time to get it
> activated on GitLab? I've put it in an MR for reference
>
Hello Niels,
sorry for the delay - I've been on vacation.
On Thu, Jul 09, 2020 at 04:05:21PM +0200, Niels Möller wrote:
> This will break support for big-endian ARM for
> now, since I'm not able to test that.
We still have the ARM BE CI ready to go. Is it maybe time to get it
activated on
ni...@lysator.liu.se (Niels Möller) writes:
> I'd like to do the same for x86_64.
I've now tried the same interleaving for salsa20 on x86_64, and it gives a
25% speedup on my laptop. Pushed to a new branch, x86_64-salsa20-2core.
Regards,
/Niels
--
Niels Möller. PGP-encrypted email is
ni...@lysator.liu.se (Niels Möller) writes:
> It interleaves the processing of two blocks, which gives a speedup of
> 50% -- 100% on the ARM cores where I've tested it. Before merging, I
> need to fix fat builds to use the new code on processors that support
> it.
I've added the fat build
I would like to help but I have no clue or experience with ARM NEON, sorry.
regards,
Mamone
On Tue, Jul 7, 2020 at 5:46 PM Niels Möller wrote:
> I've written some new ARM Neon assembly for salsa20. See
>
> https://gitlab.com/gnutls/nettle/-/commit/2ac58a1ce729a6cfe1d3703f4deb6da8862909e9
> ,
>
I've written some new ARM Neon assembly for salsa20. See
https://gitlab.com/gnutls/nettle/-/commit/2ac58a1ce729a6cfe1d3703f4deb6da8862909e9,
when configured with --enable-arm-neon.
It interleaves the processing of two blocks, which gives a speedup of
50% -- 100% on the ARM cores where I've tested