[Bug target/95435] bad builtin memcpy performance with znver1/znver2 and 32bit

2020-07-30 Thread marxin at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=95435 Martin Liška changed: What|Removed |Added Status|ASSIGNED|RESOLVED Resolution|---

[Bug target/95435] bad builtin memcpy performance with znver1/znver2 and 32bit

2020-07-30 Thread cvs-commit at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=95435 --- Comment #11 from CVS Commits --- The master branch has been updated by Martin Liska : https://gcc.gnu.org/g:dc65aba7a4725d1b464c8c64a5f739ee910e8943 commit r11-2422-gdc65aba7a4725d1b464c8c64a5f739ee910e8943 Author: Martin Liska Date:

[Bug target/95435] bad builtin memcpy performance with znver1/znver2 and 32bit

2020-07-30 Thread cvs-commit at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=95435 --- Comment #12 from CVS Commits --- The releases/gcc-10 branch has been updated by Martin Liska : https://gcc.gnu.org/g:809b4d226c7f5ded392a88ffafe8d652f911b473 commit r10-8554-g809b4d226c7f5ded392a88ffafe8d652f911b473 Author: Martin Liska

[Bug target/95435] bad builtin memcpy performance with znver1/znver2 and 32bit

2020-06-01 Thread marxin at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=95435 --- Comment #10 from Martin Liška --- And this is for znver2: AMD EPYC 7502P 32-Core Processor 64-bit: memcpy block size libcall rep1noalg rep4noalg rep8noalg loop noalg unrlnoalg sse noalg bytePGO

[Bug target/95435] bad builtin memcpy performance with znver1/znver2 and 32bit

2020-06-01 Thread ubizjak at gmail dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=95435 --- Comment #9 from Uroš Bizjak --- (In reply to Alexander Monakov from comment #8) > There's no tuning tables for memcmp at all, existing structs cover only > memset and memcpy. So as far as I see retuning memset/memcpy doesn't need to > wait

[Bug target/95435] bad builtin memcpy performance with znver1/znver2 and 32bit

2020-06-01 Thread amonakov at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=95435 --- Comment #8 from Alexander Monakov --- There's no tuning tables for memcmp at all, existing structs cover only memset and memcpy. So as far as I see retuning memset/memcpy doesn't need to wait for [1], because there's no infrastructure in

[Bug target/95435] bad builtin memcpy performance with znver1/znver2 and 32bit

2020-06-01 Thread ubizjak at gmail dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=95435 Uroš Bizjak changed: What|Removed |Added CC||hjl.tools at gmail dot com --- Comment #7

[Bug target/95435] bad builtin memcpy performance with znver1/znver2 and 32bit

2020-06-01 Thread marxin at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=95435 --- Comment #6 from Martin Liška --- All right, there are results for a micro-benchmark that we have in contrib folder: ZNVER1 (AMD Ryzen 7 2700X): 64-bit: contrib/bench-stringop 64 32000 gcc -march=znver1 memcpy block size libcall

[Bug target/95435] bad builtin memcpy performance with znver1/znver2 and 32bit

2020-06-01 Thread marxin at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=95435 Martin Liška changed: What|Removed |Added CC||marxin at gcc dot gnu.org Last

[Bug target/95435] bad builtin memcpy performance with znver1/znver2 and 32bit

2020-05-30 Thread amonakov at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=95435 Alexander Monakov changed: What|Removed |Added CC||amonakov at gcc dot gnu.org ---

[Bug target/95435] bad builtin memcpy performance with znver1/znver2 and 32bit

2020-05-30 Thread jan at jki dot io
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=95435 --- Comment #4 from Jan --- Sorry bad wording on my site. I meant the code is getting slower with znver2.

[Bug target/95435] bad builtin memcpy performance with znver1/znver2 and 32bit

2020-05-30 Thread glisse at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=95435 Marc Glisse changed: What|Removed |Added Target||x86-*-* --- Comment #3 from Marc Glisse

[Bug target/95435] bad builtin memcpy performance with znver1/znver2 and 32bit

2020-05-30 Thread jan at jki dot io
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=95435 --- Comment #2 from Jan --- Created attachment 48643 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=48643=edit source code

[Bug target/95435] bad builtin memcpy performance with znver1/znver2 and 32bit

2020-05-30 Thread jan at jki dot io
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=95435 --- Comment #1 from Jan --- Created attachment 48642 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=48642=edit gcc -g -m32 -march=skylake -O1 -s testmem_modified.c -o tm32