Re: auto vectorization notes

Crayo List via Digitalmars-d-learn Fri, 27 Mar 2020 22:26:09 -0700

On Monday, 23 March 2020 at 18:52:16 UTC, Bruce Carneal wrote:

When speeds are equivalent, or very close, I usually preferauto vectorized code to explicit SIMD/__vector code as it'seasier to read. (on the downside you have to guard againstcompiler code-gen performance regressions)
One oddity I've noticed is that I sometimes need to usepragma(inline, *false*) in order to get ldc to "do the rightthing". Apparently the compiler sees the costs/benefitsdifferently in the standalone context.
More widely known techniques that have gotten people over theserial/SIMD hump include:
 1) simplified indexing relationships
 2) known count inner loops (chunkify)
3) static foreach blocks (manual inlining that the compiler"gathers")
I'd be interested to hear from others regarding their autovectorization and __vector experiences. What has worked andwhat hasn't worked in your performance sensitive dlang code?

auto vectorization is bad because you never know if your codewill get vectorized next time you make some change to it andrecompile.

Just use : https://ispc.github.io/

Re: auto vectorization notes

Reply via email to