Re: [PATCH v1 1/2] RISC-V: Combine vec_duplicate + vmerge.vv to vmerge.vx on GR2VR cost

Robin Dapp Mon, 04 Aug 2025 00:27:59 -0700

@@ -3971,15 +3971,20 @@ get_vector_binary_rtx_cost (rtx x, intscalar2vr_cost)

   rtx op_0;
   rtx op_1;

- if (GET_CODE (x) == UNSPEC)

-    {
-      op_0 = XVECEXP (x, 0, 0);
-      op_1 = XVECEXP (x, 0, 1);
-    }
-  else
+  switch (GET_CODE (x))
     {
-      op_0 = XEXP (x, 0);
-      op_1 = XEXP (x, 1);
+      case REG:
+       return COSTS_N_INSNS (1);
+      case VEC_DUPLICATE:
+       return (scalar2vr_cost + 1) * COSTS_N_INSNS (1);
+      case UNSPEC:
+       op_0 = XVECEXP (x, 0, 0);
+       op_1 = XVECEXP (x, 0, 1);
+       break;
+      default:
+       op_0 = XEXP (x, 0);
+       op_1 = XEXP (x, 1);
+       break;

Generally, the patch looks reasonable to me but costing gets more confusing bythe day :)

Originally we used get_vector_binary_rtx_cost just for binary ops but now italso takes regs, vec_duplicates etc.? And merge is not really a binary opeither, is it?

The whole idea of rtx costing is to use the recursive properties and build a"cost tree". Generally, isn't it sufficient to find a (vec_duplicate: scalar)anywhere and then be done? The only distinction we need is whether we'recosting a real vmv.v.x or any other instruction.


As we're not recursively traversing, does something like

FOR_EACH_SUBRTX (...)
if (GET_CODE (*iter) == VEC_DUPLICATE)

help, regardless of the actual operation?

--
Regards
Robin

Re: [PATCH v1 1/2] RISC-V: Combine vec_duplicate + vmerge.vv to vmerge.vx on GR2VR cost

Reply via email to