[gcc r15-7984] aarch64: Tighten pr110625_1.c regexp

Richard Sandiford via Gcc-cvs Wed, 12 Mar 2025 02:40:54 -0700

https://gcc.gnu.org/g:5cef719596400a712fc72dc54dd4ec8cdb694dd5


commit r15-7984-g5cef719596400a712fc72dc54dd4ec8cdb694dd5
Author: Richard Sandiford <[email protected]>
Date:   Wed Mar 12 09:40:10 2025 +0000

    aarch64: Tighten pr110625_1.c regexp
    
    Before r14-2877-gbf67bf4880ce5be0, the aarch64 code assumed that
    every multi-vector reduction would use single def-use cycles.
    The patch fixed it to test what the vectoriser actually planned
    to do, using newly provided information.
    
    At the time, we didn't try to use single def-use cycles for any costed
    variant in the associated testcase (gcc.target/aarch64/pr110625_1.c),
    so it was enough to check that the single-def-use latency was never
    printed to the dump file.  However, we do now consider using single
    def-use cycles for the single-lane SLP fallback.
    
    This patch therefore switches to a positive test of the
    non-single-def-use latency.  I checked that the test still failed
    in this form before r14-2877-gbf67bf4880ce5be0.
    
    gcc/testsuite/
            * gcc.target/aarch64/pr110625_1.c: Turn into a positive test for
            a vector latency of 2, rather than a negative test for a vector
            latency of 8.

Diff:
---
 gcc/testsuite/gcc.target/aarch64/pr110625_1.c | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/gcc/testsuite/gcc.target/aarch64/pr110625_1.c 
b/gcc/testsuite/gcc.target/aarch64/pr110625_1.c
index 0965cac33a00..1d0033c98333 100644
--- a/gcc/testsuite/gcc.target/aarch64/pr110625_1.c
+++ b/gcc/testsuite/gcc.target/aarch64/pr110625_1.c
@@ -1,6 +1,6 @@
 /* { dg-do compile } */
 /* { dg-options "-Ofast -mcpu=neoverse-n2 -fdump-tree-vect-details 
-fno-tree-slp-vectorize" } */
-/* { dg-final { scan-tree-dump-not "reduction latency = 8" "vect" } } */
+/* { dg-final { scan-tree-dump {Vector issue estimate:(?:(?!Cost 
model).)*reduction latency = 2\n} "vect" } } */
 
 /* Do not increase the vector body cost due to the incorrect reduction latency
     Original vector body cost = 51

[gcc r15-7984] aarch64: Tighten pr110625_1.c regexp

Reply via email to