On Wed, Sep 30, 2015 at 6:16 AM, Tung Chou <[email protected]> wrote: > Hi Trevor, > > Sandy2x takes 156076 Haswell cycles for X25519 shared-secret > computation.
Thanks, noted! https://docs.google.com/spreadsheets/d/1SO3NGX-EgIZ1slw9uExb5FoeFy5TVkuA2lEutP6roYI/edit#gid=0 > Note that, > however, the non-vectorized implementation from the Ed25519 > paper performs much better on Haswell than on Ivy Bridge: > 161648 cycles versus 182708 cycles. Yeah, 156K vs 162 Kcycles is only a small improvement on the 2011 numbers. (There was debate earlier about how recent Haswell 25519 implementations change the FourQ:25519 speedup ratio. The answer seems to be not much - maybe the speedup is 2.65x instead of 2.75x). I'm curious why the 25519 implementations in above spreadsheet compare better with (Hamburg's 448, Gueron's P-256) on Sandy Bridge than Haswell, if anyone knows. Trevor _______________________________________________ Curves mailing list [email protected] https://moderncrypto.org/mailman/listinfo/curves
