Re: [PATCH][i386] Adjust vec_construct cost for AVX256/512, penaltize elementwise load vectorization

2018-02-20 Thread Kirill Yukhin
Hello Richard, On 14 фев 11:26, Richard Biener wrote: > > The following tries to account for the fact that when constructing > AVX256 or AVX512 vectors from elements we can only use insertps to > insert into the low 128bits of a vector but have to use > vinserti128 or vinserti64x4 to build larger

RE: [PATCH][i386] Adjust vec_construct cost for AVX256/512, penaltize elementwise load vectorization

2018-02-15 Thread Shalnov, Sergey
Richard, I've benchmarked your patch on Skylake with SPEC CPU 20[06|17][fp|int]rate and another smaller benchmark suites. I found that it doesn't regress any benchmark off-noise but improves 525.x264 by 1.8%, 526.blender by 1.9% and 465.tonto by 3.2%. I think this is a good reason to merge the