So much like the changes to add_synthesis, this adjusts xor/ior
synthesis to use riscv_integer_cost rather than riscv_const_insns. For
those that didn't read the add_synthesis patch, what happens is
riscv_const_insns returns 0 for constants requiring more than 3 insns to
synthesize. So imagine if the original constant had cost 5, it's bit
inversion has cost 4. Both get converted to "0" because they're over
the maximal value and thus we can't distinguish between them and we fail
to use C' with XNOR, ORN or ANDN to improve the resulting code.
The constants here were actually from the AND cases, but given the
common ISA capability and GCC structure I suspected the AND cases would
apply to IOR/XOR, and they do.
Tested without regression on riscv32-elf and riscv64-elf, also
bootstrapped and regression tested on the k3 and c920. I'll obviously
be waiting for pre-commit CI to do its thing before moving forward.
Jeff
gcc/
* config/riscv/riscv.cc (synthesize_ior_xor): Use riscv_integer_cost
rather than riscv_const_insns.
gcc/testsuite
* gcc.target/riscv/xor-synthesis-4.c: New test.
diff --git a/gcc/config/riscv/riscv.cc b/gcc/config/riscv/riscv.cc
index d5eab5421318..74cbca077147 100644
--- a/gcc/config/riscv/riscv.cc
+++ b/gcc/config/riscv/riscv.cc
@@ -15687,7 +15687,7 @@ synthesize_ior_xor (rtx_code code, rtx operands[3])
execution and fusion in the constant synthesis those would naturally
decrease the budget. It also does not account for the IOR/XOR at
the end of the sequence which would increase the budget. */
- int budget = (TARGET_ZBS ? riscv_const_insns (operands[2], true) : -1);
+ int budget = (TARGET_ZBS ? riscv_integer_cost (INTVAL (operands[2]), true) :
-1);
int original_budget = budget;
/* Bits we need to set in operands[0]. As we synthesize the operation,
@@ -15751,7 +15751,7 @@ synthesize_ior_xor (rtx_code code, rtx operands[3])
if ((TARGET_ZBB || TARGET_XTHEADBB || TARGET_ZBKB)
&& budget < 0
&& popcount_hwi (INTVAL (operands[2])) <= 11
- && riscv_const_insns (operands[2], true) >= 3)
+ && riscv_integer_cost (INTVAL (operands[2]), true) >= 3)
{
ival = INTVAL (operands[2]);
/* First see if the constant trivially fits into 11 bits in the LSB. */
@@ -15840,9 +15840,9 @@ synthesize_ior_xor (rtx_code code, rtx operands[3])
we have Zbb, then we have XNOR and ORN. So if the inverted constant
is cheaper, invert it and use XNOR/ORN. */
if (TARGET_ZBB
- && riscv_const_insns (GEN_INT (~UINTVAL (operands[2])), true) > 0
- && (riscv_const_insns (operands[2], true)
- > riscv_const_insns (GEN_INT (~UINTVAL (operands[2])), true)))
+ && riscv_integer_cost (~UINTVAL (operands[2]), true) > 0
+ && (riscv_integer_cost (INTVAL (operands[2]), true)
+ > riscv_integer_cost (~UINTVAL (operands[2]), true)))
{
rtx x = force_reg (word_mode, GEN_INT (~UINTVAL (operands[2])));
diff --git a/gcc/testsuite/gcc.target/riscv/xor-synthesis-4.c
b/gcc/testsuite/gcc.target/riscv/xor-synthesis-4.c
new file mode 100644
index 000000000000..609a125a92f4
--- /dev/null
+++ b/gcc/testsuite/gcc.target/riscv/xor-synthesis-4.c
@@ -0,0 +1,19 @@
+/* { dg-do "compile" { target rv64 } } */
+/* { dg-options "-march=rv64gcb -mabi=lp64d" } */
+
+
+
+#define T(X) long xnor##X(long x) { return x ^ X; }
+
+T(0x000200c3233fffffUL)
+T(0x300000000003ffffUL)
+T(0x00004f10000a0fffUL)
+T(0x0000c7f3801fefffUL)
+T(0xe00000000000fc00UL)
+T(0x0eef7ffffffbbfffUL)
+T(0xff0000000000ff00UL)
+
+/* Each test above is better handled by inverting the constant
+ and using xnor. */
+
+/* { dg-final { scan-assembler-times "xnor\t" 7 } } */