On December 9, 2017 10:50:12 AM GMT+01:00, Jakub Jelinek <ja...@redhat.com> wrote: >Hi! > >When POINTER_PLUS_EXPR is vectorized, we vectorize it as a vector >PLUS_EXPR. >This (usually? Not sure about targets where sizetype has different >precision from POINTER_SIZE; maybe those just don't have vector types) >works >because we vectorize pointer variables as vectors of unsigned pointer >sized >integers and if sizetype has the same precision, then it is the same >vector >too, so both operands and result are compatible vectors. > >POINTER_DIFF_EXPR is different, the arguments are pointers which get >vectype of vectors of unsigned pointer sized integers, but the result >is a corresponding signed type, so vectype_out is vector of signed >pointer >sized integers; those aren't compatible. > >So, we can't just vectorize POINTER_DIFF_EXPR as vector MINUS_EXPR, we >need >to VCE the result from vectype to vectype_out. > >Fixed thusly, bootstrapped/regtested on x86_64-linux and i686-linux, ok >for >trunk?
OK. Richard. >2017-12-09 Jakub Jelinek <ja...@redhat.com> > > PR tree-optimization/83338 > * tree-vect-stmts.c (vectorizable_operation): Handle POINTER_DIFF_EXPR > vectorization as MINUS_EXPR with a subsequent VIEW_CONVERT_EXPR from > vector of unsigned integers to vector of signed integers. > > * gcc.dg/vect/pr83338.c: New test. > >--- gcc/tree-vect-stmts.c.jj 2017-12-08 12:21:58.000000000 +0100 >+++ gcc/tree-vect-stmts.c 2017-12-09 00:55:17.614147824 +0100 >@@ -5226,7 +5226,7 @@ vectorizable_operation (gimple *stmt, gi > stmt_vec_info stmt_info = vinfo_for_stmt (stmt); > tree vectype; > loop_vec_info loop_vinfo = STMT_VINFO_LOOP_VINFO (stmt_info); >- enum tree_code code; >+ enum tree_code code, orig_code; > machine_mode vec_mode; > tree new_temp; > int op_type; >@@ -5264,7 +5264,7 @@ vectorizable_operation (gimple *stmt, gi > if (TREE_CODE (gimple_assign_lhs (stmt)) != SSA_NAME) > return false; > >- code = gimple_assign_rhs_code (stmt); >+ orig_code = code = gimple_assign_rhs_code (stmt); > > /* For pointer addition and subtraction, we should use the normal > plus and minus for the vector operation. */ >@@ -5455,6 +5455,14 @@ vectorizable_operation (gimple *stmt, gi > /* Handle def. */ > vec_dest = vect_create_destination_var (scalar_dest, vectype); > >+ /* POINTER_DIFF_EXPR has pointer arguments which are vectorized as >+ vectors with unsigned elements, but the result is signed. So, we >+ need to compute the MINUS_EXPR into vectype temporary and >+ VIEW_CONVERT_EXPR it into the final vectype_out result. */ >+ tree vec_cvt_dest = NULL_TREE; >+ if (orig_code == POINTER_DIFF_EXPR) >+ vec_cvt_dest = vect_create_destination_var (scalar_dest, >vectype_out); >+ > /* In case the vectorization factor (VF) is bigger than the number > of elements that we can fit in a vectype (nunits), we have to generate > more than one vector stmt - i.e - we need to "unroll" the >@@ -5546,6 +5554,15 @@ vectorizable_operation (gimple *stmt, gi > new_temp = make_ssa_name (vec_dest, new_stmt); > gimple_assign_set_lhs (new_stmt, new_temp); > vect_finish_stmt_generation (stmt, new_stmt, gsi); >+ if (vec_cvt_dest) >+ { >+ new_temp = build1 (VIEW_CONVERT_EXPR, vectype_out, new_temp); >+ new_stmt = gimple_build_assign (vec_cvt_dest, >VIEW_CONVERT_EXPR, >+ new_temp); >+ new_temp = make_ssa_name (vec_cvt_dest, new_stmt); >+ gimple_assign_set_lhs (new_stmt, new_temp); >+ vect_finish_stmt_generation (stmt, new_stmt, gsi); >+ } > if (slp_node) > SLP_TREE_VEC_STMTS (slp_node).quick_push (new_stmt); > } >--- gcc/testsuite/gcc.dg/vect/pr83338.c.jj 2017-12-09 >01:00:06.602565622 +0100 >+++ gcc/testsuite/gcc.dg/vect/pr83338.c 2017-12-09 01:00:18.297422116 >+0100 >@@ -0,0 +1,10 @@ >+/* PR tree-optimization/83338 */ >+/* { dg-do compile } */ >+ >+void >+foo (char **p, char **q, __PTRDIFF_TYPE__ *r) >+{ >+ int i; >+ for (i = 0; i < 1024; i++) >+ r[i] = p[i] - q[i]; >+} > > > Jakub