https://gcc.gnu.org/bugzilla/show_bug.cgi?id=98167

--- Comment #3 from Hongtao.liu <crazylht at gmail dot com> ---
;; _3 = __builtin_ia32_shufps (b_2(D), b_2(D), 0);

(insn 7 6 8 (set (reg:V4SF 88)
        (reg/v:V4SF 86 [ b ])) "./gcc/include/xmmintrin.h":746:19 -1
     (nil))

(insn 8 7 9 (set (reg:V4SF 89)
        (reg/v:V4SF 86 [ b ])) "./gcc/include/xmmintrin.h":746:19 -1
     (nil))

(insn 9 8 10 (set (reg:V4SF 87)
        (vec_select:V4SF (vec_concat:V8SF (reg:V4SF 88)
                (reg:V4SF 89))
            (parallel [
                    (const_int 0 [0]) repeated x2
                    (const_int 4 [0x4]) repeated x2
                ]))) "./gcc/include/xmmintrin.h":746:19 -1
     (nil))
;; _5 = __builtin_ia32_shufps (a_4(D), a_4(D), 0);

(insn 11 10 12 (set (reg:V4SF 91)
        (reg/v:V4SF 85 [ a ])) "./gcc/include/xmmintrin.h":746:19 -1
     (nil))

(insn 12 11 13 (set (reg:V4SF 92)
        (reg/v:V4SF 85 [ a ])) "./gcc/include/xmmintrin.h":746:19 -1
     (nil))

(insn 13 12 14 (set (reg:V4SF 90)
        (vec_select:V4SF (vec_concat:V8SF (reg:V4SF 91)
                (reg:V4SF 92))
            (parallel [
                    (const_int 0 [0]) repeated x2
                    (const_int 4 [0x4]) repeated x2
                ]))) "./gcc/include/xmmintrin.h":746:19 -1
     (nil))


Simplify upper to

(vec_duplicate:V4SF
  (vec_select:SF (reg:V4SF 86)
                 (parallel [(const_int 0)])

then add a combine splitter transform (mult:(vec_dup op1) (vec_dup op2)) to
(vec_dup (mult:op1 op2)?

Reply via email to