Re: RFC: Allow "(mem: (reg:))"

Andrew Stubbs Thu, 19 Mar 2026 03:23:50 -0700

On 19/03/2026 08:25, Richard Biener wrote:

That said - the *scatter* optabs assume naive vectorization of the
first loop works, even when b[] = { 0, 1, 2, 0 }, so if GCN is not able
to guarantee this their "vector address store" are not scatters in
terms of what GCC assumes.  The documentation for the optabs
does not mention this constraint.

The primary use-case for the GCN port is OpenMP/OpenACC in which loopiterations are considered to be "independent" and therefore all suchconsiderations can be ignored. Not only is vectorization in play, butalso two levels of threading, so there is absolutely no guarantee whatorder operations happen. If the user writes code that is not, in fact,"independent" then that's on them.

There have indeed been a few occasions where GCC has refused to optimizebecause it would not preserve "correctness" even though all hope of thatcorrectness have already gone.

We "fixed" the floating-point reduction case by implementing "fold_left"optabs that actually do not strictly fold left, albeit only when-fopenmp is active. Consequently, the result of floating-point vectorreductions is stable, but it's not the same stable you'd get from theunvectorized loop. (The result of the outer OpenMP reduction loop, as awhole, is unstable, because the threads complete out of order.)Basically -fopenmp implies -fassociative-math, in this case.


If necessary, we'd do the same thing for scatter_store.

Andrew

Re: RFC: Allow "(mem: (reg:))"

Reply via email to