I had some local patches in my tree which adds a bswap tree code.
This breaks the aarch64 back-end vectorizing of byteswaps as we use
the standard mechanism to see if a tree code vectorizes (optabs).
Since it make sense to have consistent of the pattern names between
the vector version and the scalar version, I am proposing this patch
to make them consistent.
OK? Build and tested on aarch64-elf with no regressions.
Thanks,
Andrew Pinski
ChangeLog:
* config/aarch64/aarch64-simd-builtins.def (bswap): Use CF2 rather
than CF10 so 2 is appended on the code.
* config/aarch64/aarch64-simd.md (bswap<mode>): Rename to ...
(bswap<mode>2): This so it matches for the optabs.
Index: config/aarch64/aarch64-simd.md
===================================================================
--- config/aarch64/aarch64-simd.md (revision 218026)
+++ config/aarch64/aarch64-simd.md (working copy)
@@ -286,7 +286,7 @@ (define_insn "mul<mode>3"
[(set_attr "type" "neon_mul_<Vetype><q>")]
)
-(define_insn "bswap<mode>"
+(define_insn "bswap<mode>2"
[(set (match_operand:VDQHSD 0 "register_operand" "=w")
(bswap:VDQHSD (match_operand:VDQHSD 1 "register_operand" "w")))]
"TARGET_SIMD"
@@ -308,7 +308,7 @@ (define_expand "ctz<mode>2"
(ctz:VS (match_operand:VS 1 "register_operand")))]
"TARGET_SIMD"
{
- emit_insn (gen_bswap<mode> (operands[0], operands[1]));
+ emit_insn (gen_bswap<mode>2 (operands[0], operands[1]));
rtx op0_castsi2qi = simplify_gen_subreg(<VS:VSI2QI>mode, operands[0],
<MODE>mode, 0);
emit_insn (gen_aarch64_rbit<VS:vsi2qi> (op0_castsi2qi, op0_castsi2qi));
Index: config/aarch64/aarch64-simd-builtins.def
===================================================================
--- config/aarch64/aarch64-simd-builtins.def (revision 218026)
+++ config/aarch64/aarch64-simd-builtins.def (working copy)
@@ -317,7 +317,7 @@
VAR1 (UNOP, floatunsv4si, 2, v4sf)
VAR1 (UNOP, floatunsv2di, 2, v2df)
- VAR5 (UNOPU, bswap, 10, v4hi, v8hi, v2si, v4si, v2di)
+ VAR5 (UNOPU, bswap, 2, v4hi, v8hi, v2si, v4si, v2di)
BUILTIN_VB (UNOP, rbit, 0)