Benjamin Peterson <benja...@python.org> added the comment: On Tue, Oct 24, 2017, at 00:25, Christian Heimes wrote: > > Christian Heimes <li...@cheimes.de> added the comment: > > I'm pretty sure that your PR has disabled all SSE optimizations. AFAIK > gcc does not enable SSE3 and SSE4 on X86_64 by default. > > $ gcc -dM -E - < /dev/null | grep SSE > #define __SSE2_MATH__ 1 > #define __SSE_MATH__ 1 > #define __SSE2__ 1 > #define __SSE__ 1
Before this patch, this would cause blake2b.c to use slow SSE2 only instruction, though, right? It seems to me this represents an improvement or the status quo in all cases. With no extra GCC flags, the reference implementation is used rather than a slow SSE2 implementation. If extra -m flags are in CFLAGS, the fastest implementation for the target is used. ---------- _______________________________________ Python tracker <rep...@bugs.python.org> <https://bugs.python.org/issue31834> _______________________________________ _______________________________________________ Python-bugs-list mailing list Unsubscribe: https://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com