Hello!
> This patch adds the support to two non-isomorphic operations addsub
> and subadd for SLP vectorizer. More non-isomorphic operations can be
> added later, but the limitation is that operations on even/odd
> elements should still be isomorphic. Once such an operation is
> detected, the code of the operation used in vectorized code is stored
> and later will be used during statement transformation. Two new GIMPLE
> opeartions VEC_ADDSUB_EXPR and VEC_SUBADD_EXPR are defined. And also
> new optabs for them. They are also documented.
>
> The target supports for SSE/SSE2/SSE3/AVX are added for those two new
> operations on floating points. SSE3/AVX provides ADDSUBPD and ADDSUBPS
> instructions. For SSE/SSE2, those two operations are emulated using
> two instructions (selectively negate then add).
;; SSE3
UNSPEC_LDDQU
+ UNSPEC_SUBADD
+ UNSPEC_ADDSUB
No! Please avoid unspecs.
+(define_expand "vec_subadd_v4sf3"
+ [(set (match_operand:V4SF 0 "register_operand")
+ (unspec:V4SF
+ [(match_operand:V4SF 1 "register_operand")
+ (match_operand:V4SF 2 "nonimmediate_operand")] UNSPEC_SUBADD))]
+ "TARGET_SSE"
+{
+ if (TARGET_SSE3)
+ emit_insn (gen_sse3_addsubv4sf3 (operands[0], operands[1], operands[2]));
+ else
+ ix86_sse_expand_fp_addsub_operator (true, V4SFmode, operands);
+ DONE;
+})
Make the expander pattern look like correspondig sse3 insn and:
...
{
if (!TARGET_SSE3)
{
ix86_sse_expand_fp_...();
DONE;
}
}
Uros.