Re: [RFC] propgation leap over memory copy for struct

Jeff Law via Gcc-patches Mon, 31 Oct 2022 15:13:56 -0700


On 10/30/22 20:42, Jiufu Guo via Gcc-patches wrote:

Hi,

We know that for struct variable assignment, memory copy may be used.
And for memcpy, we may load and store more bytes as possible at one time.
While it may be not best here:
1. Before/after stuct variable assignment, the vaiable may be operated.
And it is hard for some optimizations to leap over memcpy.  Then some struct
operations may be sub-optimimal.  Like the issue in PR65421.
2. The size of struct is constant mostly, the memcpy would be expanded.  Using
small size to load/store and executing in parallel may not slower than using
large size to loat/store. (sure, more registers may be used for smaller bytes.)


In PR65421, For source code as below:
////////t.c
#define FN 4
typedef struct { double a[FN]; } A;

A foo (const A *a) { return *a; }
A bar (const A a) { return a; }

So the first question in my mind is can we do better at the gimplephase? For the second case in particular can't we just "return a"rather than copying a into <retval> then returning <retval>? This feelsa lot like the return value optimization from C++. I'm not sure if itapplies to the first case or not, it's been a long time since I lookedat NRV optimizations, but it might be worth poking around in there a bit(tree-nrv.cc).

But even so, these kinds of things are still bound to happen, so it'sprobably worth thinking about if we can do better in RTL as well.

The first thing that comes to my mind is to annotate memcpy calls thatare structure assignments. The idea here is that we may want to expanda memcpy differently in those cases. Changing how we expand an opaquememcpy call is unlikely to be beneficial in most cases. But changinghow we expand a structure copy may be beneficial by exposing theunderlying field values. This would roughly correspond to your method #1.

Or instead of changing how we expand, teach the optimizers about theseannotated memcpy calls -- they're just a a copy of each field. That'show CSE and the propagators could treat them. After some point we'dlower them in the usual ways, but at least early in the RTL pipeline wecould keep them as annotated memcpy calls. This roughly corresponds toyour second suggestion.



jeff

Re: [RFC] propgation leap over memory copy for struct

Reply via email to