Re: [PATCH] Improve AVX V4DF and V8SF permutations (PR target/52568)

2012-03-15 Thread Richard Henderson
On 03/15/12 13:09, Jakub Jelinek wrote: > Hi! > > As noted in the PR, we can vectorize e.g. V4DFmode > __builtin_shuffle (, {1, 2, 3, 0}) in 3 insns, some intra-lane > permutation, followed by swapping of the lanes (vperm2f128) and > finally vblend{pd,ps} that merges in the registers with non-swap

[PATCH] Improve AVX V4DF and V8SF permutations (PR target/52568)

2012-03-15 Thread Jakub Jelinek
Hi! As noted in the PR, we can vectorize e.g. V4DFmode __builtin_shuffle (, {1, 2, 3, 0}) in 3 insns, some intra-lane permutation, followed by swapping of the lanes (vperm2f128) and finally vblend{pd,ps} that merges in the registers with non-swapped and swapped lanes. Bootstrapped/regtested on x8