On Sat, Jun 20, 2020 at 11:54 AM Niels Möller <ni...@lysator.liu.se> wrote:

> Have you measured speedup when going from 4 to 8 blocks? We shouldn't
> add larger loops than needed.
>

The 8x loop has x~1.15 performance boost over 4x loop, if you think it's
not worth it, I can add only 4x loop to make the code simpler.


> Do you measure a speedup from this? Karatsuba usually pays off only for
> a bit larger sizes (but I guess overhead is a little less here than for
> standard multiplication).
>

Actually, I considered the Karatsuba algorithm not only for performance but
to reduce the number of registers used. However, I believe that using the
Karatsuba algorithm in my case performs better or similar to classical
multiplication.


> >    - Since the functionality of gcm_set_key() is replaced with
> >    gcm_init_key() for PowerPC64LE, two warnings will pop up:
> [‘gcm_gf_shift’
> >    defined but not used] and [‘gcm_gf_add’ defined but not used]
>

  When I applied the patch to the last upstream, these warnings did not
appear,        some changes have occurred to gcm.c, I will look at it.


> To test PPC code, I wonder if it's easy to add a PPC build to
> .gitlab-ci, in the same way as arm and mips tests. These are based on
> Debian packaged cross compilers and qemu-user. I'm also not that familiar
> with the variants within the Power and PowerPC family of processors.
>

I will see what I can do.
_______________________________________________
nettle-bugs mailing list
nettle-bugs@lists.lysator.liu.se
http://lists.lysator.liu.se/mailman/listinfo/nettle-bugs

Reply via email to