https://gcc.gnu.org/bugzilla/show_bug.cgi?id=106565
--- Comment #2 from Quanhua Liu ---
I modified the application code (see below) and use the "method" as a control
variable from command line.
I use the same code for both gfortran 10.3.0 and ifort 19.0.5.281
gfortran -O3 matrixCal.f90
time
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=106565
--- Comment #5 from Quanhua Liu ---
Hi Richard,
Using -fexternal-blas for gfortran v10.3.0 is much slower than
the method 2:
BB = transpose(B)
C = matmul(A, BB)
How about on your machine?
Thanks,
Quanhua Liu
On 8/9/2022 11:07 AM,
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=106565
--- Comment #4 from Quanhua Liu ---
Using
gfortran -O3 -fexternal-blas -L/. -lblas testmatrixCal.f90
time a.out 1
real: 6.14 (s)
time a.out 2
real: 5.41
It is 6 times slower than
BB = transpose(B)
C = matmul(A, BB)
ifort doesn't
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=106565
--- Comment #9 from Quanhua Liu ---
Hi Richard,
It seems that I cannot add comment online to the ticket.
I tried
gfortran -o z -O3 -march=native test_matrixCal.f90 -fexternal-blas
-lblas -fdump-tree-optimized
time a.out 1
and
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=106565
Bug ID: 106565
Summary: Using a transposed matrix in matmul (GCC-10.3.0) is
very slow
Product: gcc
Version: unknown
Status: UNCONFIRMED
Severity: normal