Hello Richard,
On 14 фев 11:26, Richard Biener wrote:
>
> The following tries to account for the fact that when constructing
> AVX256 or AVX512 vectors from elements we can only use insertps to
> insert into the low 128bits of a vector but have to use
> vinserti128 or vinserti64x4 to build larger
Richard,
I've benchmarked your patch on Skylake with SPEC CPU 20[06|17][fp|int]rate
and another smaller benchmark suites. I found that it doesn't regress
any benchmark off-noise but improves 525.x264 by 1.8%, 526.blender by 1.9% and
465.tonto by 3.2%.
I think this is a good reason to merge the