From: Joe Ramsay <[email protected]> Date: Thursday, 28 May 2020 at 16:19 To: Gcc-patches <[email protected]> Subject: [PATCH]: aarch64: add support for unpacked EOR, ORR and AND
Hi!
This patch improves code generation for EOR, ORR and AND on unpacked vectors
with SVE. The following function:
void f (unsigned int *x, unsigned short *y, unsigned short *z) {
for (int i = 0; i < 7; ++i)
x[i] = (unsigned short) (y[i] & z[i]);
}
previously compiled to
ptrue p1.d, vl3
ld1h z0.d, p1/z, [x1, #1, mul vl]
ptrue p0.b, vl32
st1h z0.d, p0, [sp, #1, mul vl]
ld1h z0.d, p1/z, [x2, #1, mul vl]
st1h z0.d, p0, [sp]
ldr x3, [x2]
ldp x4, x2, [sp]
ldr x1, [x1]
and x1, x3, x1
and x2, x2, x4
str x2, [sp]
ld1h z0.d, p0/z, [sp]
str x1, [sp]
uxth z0.s, p0/m, z0.s
st1w z0.d, p1, [x0, #1, mul vl]
ld1h z0.d, p0/z, [sp]
uxth z0.s, p0/m, z0.s
st1w z0.d, p0, [x0]
add sp, sp, 16
ret
and now compiles to:
ptrue p0.s, vl7
ptrue p1.b, vl32
ld1h z1.s, p0/z, [x1]
ld1h z0.s, p0/z, [x2]
add z0.h, z0.h, z1.h
uxth z0.s, p1/m, z0.s
st1w z0.s, p0, [x0]
ret
Tested on aarch64-linux-gnu and x86_64-linux-gnu hosts.
Thanks,
Joe
2020-05-20 Joe Ramsay <[email protected]>
* config/aarch64/aarch64-sve.md (<LOGICAL:optab><mode>3):
Add support for unpacked EOR, ORR, AND.
gcc/testsuite/ChangeLog
2020-05-20 Joe Ramsay <[email protected]>
* gcc.target/aarch64/sve/load_const_offset_2.c: Force using packed
vectors.
* gcc.target/aarch64/sve/logical_unpacked_and_1.c: New test.
* gcc.target/aarch64/sve/logical_unpacked_and_2.c: New test.
* gcc.target/aarch64/sve/logical_unpacked_and_3.c: New test.
* gcc.target/aarch64/sve/logical_unpacked_and_4.c: New test.
* gcc.target/aarch64/sve/logical_unpacked_and_5.c: New test.
* gcc.target/aarch64/sve/logical_unpacked_and_6.c: New test.
* gcc.target/aarch64/sve/logical_unpacked_and_7.c: New test.
* gcc.target/aarch64/sve/logical_unpacked_eor_1.c: New test.
* gcc.target/aarch64/sve/logical_unpacked_eor_2.c: New test.
* gcc.target/aarch64/sve/logical_unpacked_eor_3.c: New test.
* gcc.target/aarch64/sve/logical_unpacked_eor_4.c: New test.
* gcc.target/aarch64/sve/logical_unpacked_eor_5.c: New test.
* gcc.target/aarch64/sve/logical_unpacked_eor_6.c: New test.
* gcc.target/aarch64/sve/logical_unpacked_eor_7.c: New test.
* gcc.target/aarch64/sve/logical_unpacked_orr_1.c: New test.
* gcc.target/aarch64/sve/logical_unpacked_orr_2.c: New test.
* gcc.target/aarch64/sve/logical_unpacked_orr_3.c: New test.
* gcc.target/aarch64/sve/logical_unpacked_orr_4.c: New test.
* gcc.target/aarch64/sve/logical_unpacked_orr_5.c: New test.
* gcc.target/aarch64/sve/logical_unpacked_orr_6.c: New test.
* gcc.target/aarch64/sve/logical_unpacked_orr_7.c: New test.
* gcc.target/aarch64/sve/scatter_store_6.c: Force using packed vectors.
* gcc.target/aarch64/sve/scatter_store_7.c: Force using packed vectors.
* gcc.target/aarch64/sve/strided_load_3.c: Force using packed vectors.
* gcc.target/aarch64/sve/strided_store_3.c: Force using packed vectors.
* gcc.target/aarch64/sve/unpack_signed_1.c: Force using packed vectors.
0001-Support-AND-ORR-EOR-on-unpacked-vectors.patch
Description: 0001-Support-AND-ORR-EOR-on-unpacked-vectors.patch
