[Bug target/95661] Code built with -m32 uses SSE2 instructions

2020-06-13 Thread memmerto at ca dot ibm.com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=95661 --- Comment #4 from Matt Emmerton --- Thank you everyone. This fully explains why we were still getting SSE in 32-bit mode.

[Bug c/95661] New: Code built with -m32 uses SSE2 instructions

2020-06-12 Thread memmerto at ca dot ibm.com
Assignee: unassigned at gcc dot gnu.org Reporter: memmerto at ca dot ibm.com Target Milestone: --- When building 32-bit code with -m32, SSE2 instructions are generated. This is in contrast to the docs. https://gcc.gnu.org/onlinedocs/gcc/x86-Options.html "The -m32 option

[Bug target/93177] PPC: Missing many useful platform intrinsics

2020-01-24 Thread memmerto at ca dot ibm.com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=93177 --- Comment #14 from Matt Emmerton --- I'd like to thank everyone for the great discussion so far. Here's a summary of where we are at this point. 1) sync intrinsics Useful, but with caveats. 2) cache prefetch intrinsics Implemented via __bui

[Bug target/93417] New: PPC: Support the "Flag Output Operands" so inline-asm can avoid having to copy CRx to GPR

2020-01-24 Thread memmerto at ca dot ibm.com
NCONFIRMED Severity: normal Priority: P3 Component: target Assignee: unassigned at gcc dot gnu.org Reporter: memmerto at ca dot ibm.com Target Milestone: --- From https://gcc.gnu.org/bugzilla/show_bug.cgi?id=93177 > If PowerPC back-end supported the

[Bug target/93408] New: PPC: Provide intrinsics for cache prefetch instructions

2020-01-23 Thread memmerto at ca dot ibm.com
Priority: P3 Component: target Assignee: unassigned at gcc dot gnu.org Reporter: memmerto at ca dot ibm.com Target Milestone: --- From https://gcc.gnu.org/bugzilla/show_bug.cgi?id=93177 > > > > __protected_stream_set() > > > >

[Bug target/93177] PPC: Missing many useful platform intrinsics

2020-01-23 Thread memmerto at ca dot ibm.com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=93177 --- Comment #11 from Matt Emmerton --- > > > > The implementation of stwcx() and stdcx() need revision on PPC. > > > > As I understand it, there is no need the mfocrf instruction nor the > > > > mask-and-shift on result. > > > > > > How else wou

[Bug target/93177] PPC: Missing many useful platform intrinsics

2020-01-13 Thread memmerto at ca dot ibm.com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=93177 --- Comment #9 from Matt Emmerton --- (In reply to Segher Boessenkool from comment #6) > (In reply to Matt Emmerton from comment #4) > > The intrinsics that we would find useful, having used them as provided by > > the IBM XL C/C++ compiler, are

[Bug target/93177] PPC: Missing many useful platform intrinsics

2020-01-13 Thread memmerto at ca dot ibm.com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=93177 --- Comment #8 from Matt Emmerton --- (In reply to Andrew Pinski from comment #5) > > __lwarx() > > __ldarx() > > __stwcx() > > __stdcx() > > Is there a reason why the __atomic_* builtins don't work? There are places in our code where we do man

[Bug target/93177] PPC: Missing many useful platform intrinsics

2020-01-10 Thread memmerto at ca dot ibm.com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=93177 --- Comment #4 from Matt Emmerton --- The intrinsics that we would find useful, having used them as provided by the IBM XL C/C++ compiler, are the following: __sync() __isync() __lwsync() __dcbt() __dcbtst() __lwarx() __ldarx() __stwcx() __std

[Bug target/93177] PPC: Missing many useful platform intrinsics

2020-01-08 Thread memmerto at ca dot ibm.com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=93177 --- Comment #2 from Matt Emmerton --- This appears to have packaging complications by vendors as well :( On powerpc-ibm-aix7.1.0.0 this doesn't get installed. On ppc64le-redhat-linux it does. However, both of these cases would benefit from some

[Bug target/93177] New: PPC: Missing many useful platform intrinsics

2020-01-06 Thread memmerto at ca dot ibm.com
: target Assignee: unassigned at gcc dot gnu.org Reporter: memmerto at ca dot ibm.com Target Milestone: --- File gcc/config/rs6000/ppu_intrinsics.h defines a lot of useful intrisics for PPC, but this heading on this file indicates that it is specific to the "Cell BEA",