[Bug target/70133] AArch64 -mtune=native generates improperly formatted -march parameters

2016-03-18 Thread andrewm.roberts at sky dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=70133 Andrew Roberts changed: What|Removed |Added CC||andrewm.roberts at sky dot com

[Bug bootstrap/67728] Build fails when cross-compiling with in-tree GMP and ISL

2016-03-13 Thread andrewm.roberts at sky dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=67728 --- Comment #8 from Andrew Roberts --- The initial bug report was for cross compiling. Bug 70211 is for native builds on ARM. Given the huge growth in ARM development boards, this needs at least documenting. As with the original reporter I spent

[Bug bootstrap/67728] Build fails when cross-compiling with in-tree GMP and ISL

2016-03-14 Thread andrewm.roberts at sky dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=67728 --- Comment #11 from Andrew Roberts --- On Native ARM platform the bootstrap does work with the old in tree GMP 4.3.2, regardless of wether you use none-linux-gnu or armv7l-linux-gnu when configuring GMP. Bulding by patching toplevel Makefile

[Bug bootstrap/70211] New: gcc-6-20160306 fails to build on ARM Linux with in tree ISL due to undefined GMP symbol __gmpn_invert_limb in isl_test

2016-03-12 Thread andrewm.roberts at sky dot com
Version: 6.0 Status: UNCONFIRMED Severity: normal Priority: P3 Component: bootstrap Assignee: unassigned at gcc dot gnu.org Reporter: andrewm.roberts at sky dot com Target Milestone: --- Build of gcc-6-20160306 fails on ARM Linux when

[Bug c/70210] New: -march=native and -mcpu=native do not detect ARM cortex-a53 in 32 bit mode on Linux

2016-03-12 Thread andrewm.roberts at sky dot com
Severity: normal Priority: P3 Component: c Assignee: unassigned at gcc dot gnu.org Reporter: andrewm.roberts at sky dot com Target Milestone: --- When using -march=native or -mcpu=native with gcc-6-20160306 snapshot (and also on previous released versions

[Bug c/70210] -march=native and -mcpu=native do not detect ARM cortex-a53 in 32 bit mode on Linux

2016-03-12 Thread andrewm.roberts at sky dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=70210 --- Comment #1 from Andrew Roberts --- Note this was causing bug 70132 (ARM -mcpu=native can cause a double free abort). A patch as been subitted to fix the double free, but doesn't address the failure to detect the CPU

[Bug driver/70132] ARM -mcpu=native can cause a double free abort.

2016-03-11 Thread andrewm.roberts at sky dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=70132 --- Comment #4 from Andrew Roberts --- Patch tested OK, on Raspberry Pi 3, on Arch Linux using latest gcc 6 snapshot: /usr/local/gcc-6.0.0/bin/gcc -v Using built-in specs. COLLECT_GCC=/usr/local/gcc-6.0.0/bin/gcc

[Bug driver/70132] ARM -mcpu=native can cause a double free abort.

2016-03-11 Thread andrewm.roberts at sky dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=70132 --- Comment #5 from Andrew Roberts --- Do I need to raise another bug report to get the march=native to actually generate native code, or has one already been raised? My original report (Bug 70136) included full /proc/cpuinfo for the BCM2834

[Bug bootstrap/67728] Build fails when cross-compiling with in-tree GMP and ISL

2016-03-16 Thread andrewm.roberts at sky dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=67728 --- Comment #15 from Andrew Roberts --- Marc, not entirely clear what you mean by reproducing the issue without downloading mpfr, mpc, isl etc. Do you mean the missing symbol in GMP or the issues with GMP when using assembly code? If you could

[Bug bootstrap/70211] gcc-6-20160306 fails to build on ARM Linux with in tree ISL due to undefined GMP symbol __gmpn_invert_limb in isl_test

2016-03-13 Thread andrewm.roberts at sky dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=70211 --- Comment #1 from Andrew Roberts --- Looking at the toplevel Makefile.in the gmp targets (maybe-configure-gmp, configure-gmp etc) use: --build=${build_alias} --host=none-${host_vendor}-${host_os} --target=none-${host_vendor}-${host_os} where

[Bug bootstrap/67728] Build fails when cross-compiling with in-tree GMP and ISL

2016-03-22 Thread andrewm.roberts at sky dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=67728 --- Comment #22 from Andrew Roberts --- Tested with: gcc-6-20160313 and in-tree: gmp-6.1.99-20160321 mpc-1.0.3 mpfr-3.1.4 isl-0.16.1 On: armv7l Arch Linux Arm (Raspberry Pi 3) (not bootstrapped yet due to build time) This also builds ok with

[Bug bootstrap/67728] Build fails when cross-compiling with in-tree GMP and ISL

2016-03-21 Thread andrewm.roberts at sky dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=67728 --- Comment #21 from Andrew Roberts --- Tested with: gcc-6-20160313 and in-tree: gmp-6.1.99-20160321 mpc-1.0.3 mpfr-3.1.4 isl-0.16.1 On: x86_64 Centos 7 (Full bootstrap) This is Ok. /usr/local/gcc-6.0.0/bin/gcc -v Using built-in specs.

[Bug bootstrap/67728] Build fails when cross-compiling with in-tree GMP and ISL

2016-03-27 Thread andrewm.roberts at sky dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=67728 --- Comment #25 from Andrew Roberts --- The patch works on native armv7l-unknown-linux-gnuabihf with: gcc-6-20160320 and in tree gmp 6.1.0 mpc 1.0.3 mpfr 3.1.4 isl 0.16.1 although I wasn't seeing a problem with check-mpc. At least the build

[Bug c/70136] New: -march=native causes SIGABRT due to double close of FILE on certain ARM systems (BCM2834, armv8 cortex-a53)

2016-03-08 Thread andrewm.roberts at sky dot com
Status: UNCONFIRMED Severity: normal Priority: P3 Component: c Assignee: unassigned at gcc dot gnu.org Reporter: andrewm.roberts at sky dot com Target Milestone: --- gcc 4.9.1 (Raspbian Linux) gcc 5.3.0 (Arch Linux) gcc 6-20160306 (Arch

[Bug bootstrap/78471] New: gcc-7-20161120 and truck fail to build on armv7l with ICE in cp-demangle.c, earlier snapshots ok

2016-11-22 Thread andrewm.roberts at sky dot com
Severity: normal Priority: P3 Component: bootstrap Assignee: unassigned at gcc dot gnu.org Reporter: andrewm.roberts at sky dot com Target Milestone: --- Created attachment 40110 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=40110=e

[Bug middle-end/81818] aarch64 uses 2-3x memory and 2x time of arm at -Os, -O2, -O3

2017-08-16 Thread andrewm.roberts at sky dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=81818 --- Comment #6 from Andrew Roberts --- Looks like this info got purged by the bugzilla failure, here it is again: Ok, I've done some more digging. Looking at the optimization options enabled by -O2 vs -O1, I built the test program at -O1 and

[Bug middle-end/81818] aarch64 uses 2-3x memory and 2x time of arm at -Os, -O2, -O3

2017-08-16 Thread andrewm.roberts at sky dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=81818 --- Comment #7 from Andrew Roberts --- I'll try the memory testing on both arm and aarch64. I've also tried -fopt-info-all-optall, I was hoping this would provide some info on what was happening, but it only seems to give any output under -O3.

[Bug middle-end/81818] aarch64 uses 2-3x memory and 2x time of arm at -Os, -O2, -O3

2017-08-16 Thread andrewm.roberts at sky dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=81818 --- Comment #8 from Andrew Roberts --- I've tried building gcc-8-20170806 and gcc-8-20170813 with --enable-gather-detailed-mem-stats This fails on x86-64, arm and aarch64 with the same error. The recently released 7.2.0 build ok on x86-64 at

[Bug bootstrap/81864] New: building gcc 8 with --enable-gather-detailed-mem-stats fails on x86-64, arm and aarch64 under gnu linux

2017-08-16 Thread andrewm.roberts at sky dot com
: UNCONFIRMED Severity: normal Priority: P3 Component: bootstrap Assignee: unassigned at gcc dot gnu.org Reporter: andrewm.roberts at sky dot com Target Milestone: --- Building gcc gcc-8-20170806 with --enable-gather-detailed-mem-stats fails: On x64, arm

[Bug middle-end/81818] aarch64 uses 2-3x memory and 2x time of arm at -Os, -O2, -O3

2017-08-11 Thread andrewm.roberts at sky dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=81818 --- Comment #4 from Andrew Roberts --- Looking at --param ggc-min-expand and --param ggc-min-heapsize For gcc 8.0.0: on arm with 1Gb RAM: GGC heuristics: --param ggc-min-expand=93 --param ggc-min-heapsize=119808 on aarch64 with 1Gb RAM: GGC

[Bug c++/81818] aarch64 uses 2-3x memory and 2x time of arm at -Os, -O2, -O3 (memory-hog)

2017-08-11 Thread andrewm.roberts at sky dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=81818 --- Comment #2 from Andrew Roberts --- Created attachment 41975 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=41975=edit Full test results for aarch64 on Raspberry Pi3 Test results for -O0, -Os, -O1, -O2, -O3 for gcc 5.4.0, 6.4.0, 7.2.0,

[Bug c++/81818] aarch64 uses 2-3x memory and 2x time of arm at -Os, -O2, -O3 (memory-hog)

2017-08-11 Thread andrewm.roberts at sky dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=81818 --- Comment #1 from Andrew Roberts --- Created attachment 41974 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=41974=edit Full test report for Raspberry Pi ARM Test results for -O0, -Os, -O1, -O2, -O3 for gcc 5.4.0, 6.4.0, 7.2.0, 8.0.0 on

[Bug c++/81818] New: aarch64 uses 2-3x memory and 2x time of arm at -Os, -O2, -O3 (memory-hog)

2017-08-11 Thread andrewm.roberts at sky dot com
: normal Priority: P3 Component: c++ Assignee: unassigned at gcc dot gnu.org Reporter: andrewm.roberts at sky dot com Target Milestone: --- Created attachment 41973 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=41973=edit System independent test prog

[Bug c++/81818] aarch64 uses 2-3x memory and 2x time of arm at -Os, -O2, -O3 (memory-hog)

2017-08-11 Thread andrewm.roberts at sky dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=81818 --- Comment #3 from Andrew Roberts --- I've added the test results for the arm and aarch64 builds on Raspberry Pi3. These show compilation time, memory used, and object file size for: -O0, -Os, -O1, -O2, -O3 using gcc 5.4.0, 6.4.0, 7.2.0, and

[Bug middle-end/81818] aarch64 uses 2-3x memory and 2x time of arm at -Os, -O2, -O3

2017-08-13 Thread andrewm.roberts at sky dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=81818 --- Comment #5 from Andrew Roberts --- Ok, I've done some more digging. Looking at the optimization options enabled by -O2 vs -O1, I built the test program at -O1 and enabled each optimization in turn, on both ARM and AARCH64. It looks like

[Bug middle-end/81818] aarch64 uses 2-3x memory and 2x time of arm at -Os, -O2, -O3

2017-08-17 Thread andrewm.roberts at sky dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=81818 --- Comment #10 from Andrew Roberts --- I've attached the output for gcc 7.2.0 with -fmem-report (as gcc-7.2.0-fmem-report.tar.bz2). g++ -Ox -fmem-report -c testmap.cpp where -Ox is one of: -O0, -O1, -O2, -O3, or -O1 -fgcse This is across: x64

[Bug middle-end/81818] aarch64 uses 2-3x memory and 2x time of arm at -Os, -O2, -O3

2017-08-17 Thread andrewm.roberts at sky dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=81818 --- Comment #11 from Andrew Roberts --- Created attachment 41992 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=41992=edit gcc-7.2.0 -fmem-report output for arm, aarch64, and x86-64 Output for gcc 7.2.0 with -fmem-report (as

[Bug bootstrap/81864] building gcc 8 with --enable-gather-detailed-mem-stats fails on x86-64, arm and aarch64 under gnu linux

2017-08-17 Thread andrewm.roberts at sky dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=81864 --- Comment #2 from Andrew Roberts --- I can confirm gcc 7.2.0 builds ok on x86-64, arm and aarch64 with --enable-gather-detailed-mem-stats. So its just 8.0.0 which is failing.

[Bug bootstrap/81864] building gcc 8 with --enable-gather-detailed-mem-stats fails on x86-64, arm and aarch64 under gnu linux

2017-08-17 Thread andrewm.roberts at sky dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=81864 --- Comment #7 from Andrew Roberts --- Works for me on x86-64, trying aarch64 now.

[Bug bootstrap/81864] building gcc 8 with --enable-gather-detailed-mem-stats fails on x86-64, arm and aarch64 under gnu linux

2017-08-17 Thread andrewm.roberts at sky dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=81864 --- Comment #8 from Andrew Roberts --- aarch64 also ok with gcc-8.0.0 for me.

[Bug target/82175] -march=native fails on armv7 big/little system armv7l-unknown-linux-gnueabihf with gcc 8.0.0

2017-09-11 Thread andrewm.roberts at sky dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=82175 --- Comment #1 from Andrew Roberts --- This also fails on a Raspberry PI 3 running armv7 in the same way. Looks like the armv8 code has got mixed up with the armv7 code... The RPI3 has: 4x ARM Cortex-A53 rev 4 (0x4100d030) cat /proc/cpuinfo

[Bug c/82175] New: -march=native fails on armv7 big/little system armv7l-unknown-linux-gnueabihf with gcc 8.0.0

2017-09-11 Thread andrewm.roberts at sky dot com
Severity: normal Priority: P3 Component: c Assignee: unassigned at gcc dot gnu.org Reporter: andrewm.roberts at sky dot com Target Milestone: --- gcc-7.2.0 is ok on this target, but gcc-8.0.0 fails to detect native target. cat > test.c #include

[Bug target/82175] [8 Regression] -march=native fails on armv7 big/little system armv7l-unknown-linux-gnueabihf with gcc 8.0.0

2017-10-02 Thread andrewm.roberts at sky dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=82175 --- Comment #9 from Andrew Roberts --- Created attachment 42275 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=42275=edit Output of gcc -march=native -Q --help=target for gcc 8.0.0.20171001 Output of gcc -march=native -Q --help=target for

[Bug target/82175] [8 Regression] -march=native fails on armv7 big/little system armv7l-unknown-linux-gnueabihf with gcc 8.0.0

2017-10-02 Thread andrewm.roberts at sky dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=82175 --- Comment #6 from Andrew Roberts --- Thanks Richard, this is now ok, tested on armv7 and aarch64. However I do see differences in what is selected by march=native on arm between 7.2.0 and 8.0.0.20171001. Is this as expected? Or is it a work

[Bug target/82175] [8 Regression] -march=native fails on armv7 big/little system armv7l-unknown-linux-gnueabihf with gcc 8.0.0

2017-10-02 Thread andrewm.roberts at sky dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=82175 --- Comment #10 from Andrew Roberts --- Created attachment 42276 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=42276=edit Output of gcc -march=native -Q --help=target for gcc 7.2.0 Output of gcc -march=native -Q --help=target for gcc

[Bug target/82175] [8 Regression] -march=native fails on armv7 big/little system armv7l-unknown-linux-gnueabihf with gcc 8.0.0

2017-10-02 Thread andrewm.roberts at sky dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=82175 --- Comment #8 from Andrew Roberts --- I generated it using: /usr/local/gcc-7.2.0/bin/gcc -march=native -Q --help=target and /usr/local/gcc-8.0.0/bin/gcc -march=native -Q --help=target on each of the systems. Using:

[Bug target/81616] Update -mtune=generic for the current Intel and AMD processors

2017-11-27 Thread andrewm.roberts at sky dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=81616 --- Comment #14 from Andrew Roberts --- It would be nice if znver1 for -march and -mtune could be improved before the gcc 8 release. At present -march=znver1 -mtune=znver1 looks be to about the worst thing you could do, and not just on this

[Bug target/81616] Update -mtune=generic for the current Intel and AMD processors

2017-11-27 Thread andrewm.roberts at sky dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=81616 --- Comment #17 from Andrew Roberts --- The general consensus in userland is that the znver1 optimization is much worse than 0.5%, or even 2% off. Most people are using -march=haswell if they care about performance. Just taking one part of one

[Bug target/81616] Update -mtune=generic for the current Intel and AMD processors

2017-11-28 Thread andrewm.roberts at sky dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=81616 --- Comment #24 from Andrew Roberts --- For the mt19937ar test: /usr/local/gcc/bin/gcc -march=znver1 -mtune=znver1 -O3 mt19937ar.c -o mt19937ar mt19937ar took 462062 clocks /usr/local/gcc/bin/gcc -march=znver1 -mtune=znver1

[Bug target/81616] Update -mtune=generic for the current Intel and AMD processors

2017-11-28 Thread andrewm.roberts at sky dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=81616 --- Comment #23 from Andrew Roberts --- Thanks Honza, getting closer, with original matrix.c on Ryzen: /usr/local/gcc/bin/gcc -march=znver1 -mtune=znver1 -O3 matrix.c -o matrix mult took 364850 clocks /usr/local/gcc/bin/gcc

[Bug driver/83193] New: Help for invalid -march= options from cc1 omits -march=native on x86-64, arm. aarch64, output also inconsistent

2017-11-27 Thread andrewm.roberts at sky dot com
Status: UNCONFIRMED Severity: normal Priority: P3 Component: driver Assignee: unassigned at gcc dot gnu.org Reporter: andrewm.roberts at sky dot com Target Milestone: --- -march=native no longer documented in cc1 help message

[Bug target/81616] Update -mtune=generic for the current Intel and AMD processors

2017-11-22 Thread andrewm.roberts at sky dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=81616 Andrew Roberts changed: What|Removed |Added CC||andrewm.roberts at sky dot com

[Bug target/81616] Update -mtune=generic for the current Intel and AMD processors

2017-11-22 Thread andrewm.roberts at sky dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=81616 --- Comment #9 from Andrew Roberts --- Created attachment 42690 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=42690=edit Test results for Skylake system with matrix.c Test results for Skylake system with matrix.c

[Bug target/81616] Update -mtune=generic for the current Intel and AMD processors

2017-11-22 Thread andrewm.roberts at sky dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=81616 --- Comment #8 from Andrew Roberts --- Created attachment 42689 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=42689=edit Test results for Haswell system with matrix.c Test results for Haswell system with matrix.c

[Bug target/81616] Update -mtune=generic for the current Intel and AMD processors

2017-11-22 Thread andrewm.roberts at sky dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=81616 --- Comment #7 from Andrew Roberts --- Created attachment 42688 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=42688=edit Test results for Ryzen system with matrix.c Test results for Ryzen system with matrix.c

[Bug target/81616] Update -mtune=generic for the current Intel and AMD processors

2017-11-22 Thread andrewm.roberts at sky dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=81616 --- Comment #5 from Andrew Roberts --- I've been testing on a Ryzen system and also comparing with Haswell and Skylake. From my testing -mtune=znver1 does not perform well and never has, including as of last snapshot: gcc version 8.0.0 20171119

[Bug target/81616] Update -mtune=generic for the current Intel and AMD processors

2017-11-22 Thread andrewm.roberts at sky dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=81616 --- Comment #10 from Andrew Roberts --- Created attachment 42691 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=42691=edit Script for matrix.c test program Script for matrix.c test program

[Bug target/81616] Update -mtune=generic for the current Intel and AMD processors

2017-11-22 Thread andrewm.roberts at sky dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=81616 --- Comment #6 from Andrew Roberts --- Created attachment 42687 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=42687=edit Test program used for the attached performance results (matrix.c) Test program used for the attached performance

[Bug driver/83206] New: -mfpu=auto does not work on ARM (armv7l-unknown-linux-gnueabihf)

2017-11-28 Thread andrewm.roberts at sky dot com
Priority: P3 Component: driver Assignee: unassigned at gcc dot gnu.org Reporter: andrewm.roberts at sky dot com Target Milestone: --- On ARM an option to -mfpu is auto, this is given when you do: /usr/local/gcc/bin/gcc -mcpu=native -Q --help=target ... Known ARM

[Bug target/81616] Update -mtune=generic for the current Intel and AMD processors

2017-11-28 Thread andrewm.roberts at sky dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=81616 --- Comment #29 from Andrew Roberts --- And rerunning all the tests for matrix.c on Ryzen using: -march=$amarch -mtune=$amtune -mprefer-vector-width=none -mno-fma -O3 The winners were: mult took 118145 clocks -march=broadwell -mtune=broadwell

[Bug driver/83206] -mfpu=auto does not work on ARM (armv7l-unknown-linux-gnueabihf)

2017-11-28 Thread andrewm.roberts at sky dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=83206 --- Comment #1 from Andrew Roberts --- This was tested using: /usr/local/gcc/bin/gcc -v Using built-in specs. COLLECT_GCC=/usr/local/gcc/bin/gcc

[Bug target/81616] Update -mtune=generic for the current Intel and AMD processors

2017-11-28 Thread andrewm.roberts at sky dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=81616 --- Comment #28 from Andrew Roberts --- Adding -mno-avx2 into the mix was a marginal win, but only just showing out of the noise: /usr/local/gcc/bin/gcc -march=znver1 -mtune=znver1 -mprefer-vector-width=none -mno-fma -mno-avx2 -O3 matrix.c -o

[Bug driver/83206] -mfpu=auto does not work on ARM (armv7l-unknown-linux-gnueabihf)

2017-11-29 Thread andrewm.roberts at sky dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=83206 --- Comment #2 from Andrew Roberts --- Correction: 1) This works on gcc 8 snapshot, it doesn't work on gcc-7.2.0 /usr/local/gcc-7.2.0/bin/gcc -march=native -mcpu=cortex-a53 -mfpu=auto -Ofast -o matrix matrix.c cc1: error: -mfloat-abi=hard:

[Bug driver/83207] New: On ARM -mcpu=native does not detect ARM big/little cpu combinations correctly (armv7l-unknown-linux-gnueabihf)

2017-11-28 Thread andrewm.roberts at sky dot com
Status: UNCONFIRMED Severity: normal Priority: P3 Component: driver Assignee: unassigned at gcc dot gnu.org Reporter: andrewm.roberts at sky dot com Target Milestone: --- On ARM autodetection of the CPU using -mcpu=native does not give

[Bug target/81616] Update -mtune=generic for the current Intel and AMD processors

2017-11-29 Thread andrewm.roberts at sky dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=81616 --- Comment #31 from Andrew Roberts --- of for mt19937ar with -mno-avx2 /usr/local/gcc/bin/gcc -march=$amarch -mtune=$amtune -mno-avx2 -O3 -o mt199 37ar mt19937ar.c Top 2: mt19937ar took 358493 clocks -march=silvermont -mtune=bdver1 mt19937ar

[Bug driver/83206] -mfpu=auto does not work on ARM (armv7l-unknown-linux-gnueabihf)

2017-11-29 Thread andrewm.roberts at sky dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=83206 --- Comment #3 from Andrew Roberts --- ok confirmed, this bug is still present on the gcc-7 branch, with the current snapshot: /usr/local/gcc-7.2.1/bin/gcc -v Using built-in specs. COLLECT_GCC=/usr/local/gcc-7.2.1/bin/gcc

[Bug target/81616] Update -mtune=generic for the current Intel and AMD processors

2017-11-29 Thread andrewm.roberts at sky dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=81616 --- Comment #32 from Andrew Roberts --- For what its worth, here's what the latest and greatest from the competition has to offer: /usr/local/llvm-5.0.1-rc2/bin/clang -march=znver1 -mtune=znver1 -O3 matrix.c -o matrix mult took

[Bug target/81616] Update -mtune=generic for the current Intel and AMD processors

2017-11-29 Thread andrewm.roberts at sky dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=81616 --- Comment #33 from Andrew Roberts --- That second llvm command line should read: /usr/local/llvm-5.0.1-rc2/bin/clang -march=znver1 -mtune=znver1 -Ofast mt19937ar.c -o mt19937ar

[Bug driver/83193] Help for invalid -march= options from cc1 omits -march=native on x86-64, arm. aarch64, output also inconsistent

2017-11-27 Thread andrewm.roberts at sky dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=83193 --- Comment #1 from Andrew Roberts --- The same comments also apply to the -mcpu and -mtune options as well. Double output on arm for -mcpu, and missing help for native. also: gcc -Q --help=target used to document the allowable -mcpu/-mtune

[Bug target/81616] Update -mtune=generic for the current Intel and AMD processors

2017-11-26 Thread andrewm.roberts at sky dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=81616 --- Comment #12 from Andrew Roberts --- Ok I've tried again with this weeks snapshot: gcc version 8.0.0 20171126 (experimental) (GCC) Taking combination of -march and -mtune which works well on Ryzen: /usr/local/gcc/bin/gcc -march=core-avx-i

[Bug target/81616] Update -mtune=generic for the current Intel and AMD processors

2017-11-27 Thread andrewm.roberts at sky dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=81616 --- Comment #19 from Andrew Roberts --- Created attachment 42735 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=42735=edit modified mt19937ar test program, test script and results modified mt19937ar test program, test script and results

[Bug target/81616] Update -mtune=generic for the current Intel and AMD processors

2017-11-27 Thread andrewm.roberts at sky dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=81616 --- Comment #18 from Andrew Roberts --- Ok trying an entirely different algorith, same results: Using Mersenne Twister algorithm from here: http://www.math.sci.hiroshima-u.ac.jp/~m-mat/MT/MT2002/emt19937ar.html alter main program to comment

[Bug target/81616] Update -mtune=generic for the current Intel and AMD processors

2017-11-27 Thread andrewm.roberts at sky dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=81616 --- Comment #20 from Andrew Roberts --- Again those latest mt19937ar results above were with the current snapshot: /usr/local/gcc/bin/gcc -v Using built-in specs. COLLECT_GCC=/usr/local/gcc/bin/gcc

[Bug driver/83206] -mfpu=auto does not work on ARM (armv7l-unknown-linux-gnueabihf)

2017-12-04 Thread andrewm.roberts at sky dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=83206 --- Comment #15 from Andrew Roberts --- Created attachment 42792 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=42792=edit /proc/cpuinfo fro rpi3 (cortex a-53) on aarch64 /proc/cpuinfo fro rpi3 (cortex a-53) on aarch64 while this is the

[Bug driver/83206] -mfpu=auto does not work on ARM (armv7l-unknown-linux-gnueabihf)

2017-12-04 Thread andrewm.roberts at sky dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=83206 --- Comment #9 from Andrew Roberts --- Created attachment 42787 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=42787=edit /proc/cpuinfo from cortex-a7 Raspberry Pi 2b v1.1 /proc/cpuinfo from cortex-a7 Raspberry Pi 2b v1.1

[Bug driver/83206] -mfpu=auto does not work on ARM (armv7l-unknown-linux-gnueabihf)

2017-12-04 Thread andrewm.roberts at sky dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=83206 --- Comment #10 from Andrew Roberts --- Created attachment 42788 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=42788=edit /proc/cpuinfo from odroid-xu4 big/little cortex-a15/cortex-a7 /proc/cpuinfo from odroid-xu4 big/little

[Bug driver/83206] -mfpu=auto does not work on ARM (armv7l-unknown-linux-gnueabihf)

2017-12-04 Thread andrewm.roberts at sky dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=83206 --- Comment #12 from Andrew Roberts --- Created attachment 42790 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=42790=edit /proc/cpuinfo from Raspberry Pi 3 (cortex-A53) arm mode /proc/cpuinfo from Raspberry Pi 3 (cortex-A53) arm mode

[Bug driver/83206] -mfpu=auto does not work on ARM (armv7l-unknown-linux-gnueabihf)

2017-12-04 Thread andrewm.roberts at sky dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=83206 --- Comment #13 from Andrew Roberts --- Created attachment 42791 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=42791=edit /proc/cpuinfo from odroid-c2 (cortex-A53) aarch64 mode /proc/cpuinfo from odroid-c2 (cortex-A53) aarch64 mode

[Bug driver/83206] -mfpu=auto does not work on ARM (armv7l-unknown-linux-gnueabihf)

2017-12-04 Thread andrewm.roberts at sky dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=83206 --- Comment #11 from Andrew Roberts --- Created attachment 42789 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=42789=edit /proc/cpuinfo from rpi b (arm1176jzf-s) /proc/cpuinfo from rpi b (arm1176jzf-s)

[Bug driver/83206] -mfpu=auto does not work on ARM (armv7l-unknown-linux-gnueabihf)

2017-12-04 Thread andrewm.roberts at sky dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=83206 --- Comment #14 from Andrew Roberts --- Richard, I have checked with latest snapshot (20171203) and problem persists. I think the issue is that the CPU on the original Raspberry Pi and Pi Zero is not detected properly by gcc.

[Bug driver/83206] -mfpu=auto does not work on ARM (armv7l-unknown-linux-gnueabihf)

2017-12-03 Thread andrewm.roberts at sky dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=83206 --- Comment #5 from Andrew Roberts --- It looks like I was right about this all along, its just that armv6l isn't working. armv7l seems ok: On RaspberryPi B - ARM1176 rev 7 (0x4100b760) cat /proc/cpuinfo processor : 0 model name :

[Bug driver/83206] -mfpu=auto does not work on ARM (armv7l-unknown-linux-gnueabihf)

2017-12-04 Thread andrewm.roberts at sky dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=83206 --- Comment #7 from Andrew Roberts --- I get the same thing if I just use -mcpu=native: /usr/local/gcc/bin/gcc -o matrix-v6 -mcpu=native -mfpu=auto -O3 matrix.c cc1: error: -mfloat-abi=hard: selected processor lacks an FPU I realize the

[Bug driver/83206] -mfpu=auto does not work on ARM (armv7l-unknown-linux-gnueabihf)

2017-12-10 Thread andrewm.roberts at sky dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=83206 --- Comment #18 from Andrew Roberts --- Richard, I'm giving the latest snapshot a test, the armv6 version will be ready in 16 hrs or so... Meanwhile a question about consistency with gcc -Q --help=target, and also what happens if you don't

[Bug driver/83206] -mfpu=auto does not work on ARM (armv7l-unknown-linux-gnueabihf)

2017-12-11 Thread andrewm.roberts at sky dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=83206 --- Comment #20 from Andrew Roberts --- The patch in in latest snapshot is working ok on Raspberry Pi Zero. And -help=target now returns: /usr/local/gcc/bin/gcc -march=native -mcpu=native -mfpu=auto -Q --help=target | grep "march\|mcpu\|mfpu"

[Bug web/85578] broken links in gcc-8.0.1-RC-20180427/INSTALL/specific.html, and out of date prerequisites.html

2018-05-01 Thread andrewm.roberts at sky dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=85578 --- Comment #2 from Andrew Roberts --- Ok thanks, just checking on the prerequisites front.

[Bug web/85578] New: broken links in gcc-8.0.1-RC-20180427/INSTALL/specific.html, and out of date prerequisites.html

2018-04-30 Thread andrewm.roberts at sky dot com
Status: UNCONFIRMED Severity: normal Priority: P3 Component: web Assignee: unassigned at gcc dot gnu.org Reporter: andrewm.roberts at sky dot com Target Milestone: --- The file INSTALL/specific.html in gcc-8.0.1-RC-20180427 contains many broken links

[Bug target/81616] Update -mtune=generic for the current Intel and AMD processors

2018-01-22 Thread andrewm.roberts at sky dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=81616 --- Comment #46 from Andrew Roberts --- With the latest snapshot: gcc version 8.0.1 20180121 For the mt19937ar things now look reasonable without any strange options on Ryzen. Top 5 mt19937ar took 226849 clocks -march=amdfam10 -mtune=btver2

[Bug target/81616] Update -mtune=generic for the current Intel and AMD processors

2018-01-22 Thread andrewm.roberts at sky dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=81616 --- Comment #47 from Andrew Roberts --- Again with the latest snapshot: gcc version 8.0.1 20180121 matrix.c is still needing additional options to get the best out of the Ryzen processor. But is better than before (223029 clocks vs 371978

[Bug target/81616] Update -mtune=generic for the current Intel and AMD processors

2018-01-22 Thread andrewm.roberts at sky dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=81616 --- Comment #48 from Andrew Roberts --- Correction, that should be 23 not 23000 for the haswell drop in performance.

[Bug driver/83193] Help for invalid -march= options from cc1 omits -march=native on x86-64, arm. aarch64, output also inconsistent

2018-02-19 Thread andrewm.roberts at sky dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=83193 --- Comment #4 from Andrew Roberts --- Correct, it does not show native in the list of valid options presented to the user.

[Bug bootstrap/83903] New: gcc 8.0.0 20180114 fails to bootstrap on Darwin x86_64, undeclared ASM_OUTPUT_DEF

2018-01-16 Thread andrewm.roberts at sky dot com
: normal Priority: P3 Component: bootstrap Assignee: unassigned at gcc dot gnu.org Reporter: andrewm.roberts at sky dot com Target Milestone: --- Building gcc 8.0.0 20180114 on Darwin (OS X) with llvm fails with an undeclared identifier: .../../gcc-8.0.0

[Bug target/81616] Update -mtune=generic for the current Intel and AMD processors

2018-01-22 Thread andrewm.roberts at sky dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=81616 --- Comment #50 from Andrew Roberts --- with the matrix.c benchmark on Ryzen and looking at the other options when using -march=znver1 and -mtune=znver1 mult took 225281 clocks -march=znver1 -mtune=znver1 -mprefer-vector-width=128 mult took

[Bug driver/83206] -mfpu=auto does not work on ARM (armv7l-unknown-linux-gnueabihf)

2018-03-07 Thread andrewm.roberts at sky dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=83206 --- Comment #23 from Andrew Roberts --- RPI Zero still looks ok with latest snapshot. /usr/local/gcc/bin/gcc -mfpu=auto -O3 -o matrix matrix.c cc1: error: -mfloat-abi=hard: selected processor lacks an FPU /usr/local/gcc/bin/gcc -mcpu=native

[Bug bootstrap/84800] New: ICE building gcc in isl_factorization.c with xgcc on SPARC Solaris with 8-20180304 snapshot

2018-03-09 Thread andrewm.roberts at sky dot com
Severity: normal Priority: P3 Component: bootstrap Assignee: unassigned at gcc dot gnu.org Reporter: andrewm.roberts at sky dot com Target Milestone: --- Building gcc-8-20180304 failed on native SPARC Solaris 10 (sparc-sun-solaris2.10). I had previously

[Bug bootstrap/84800] ICE building gcc in isl_factorization.c with xgcc on SPARC Solaris with 8-20180304 snapshot

2018-03-14 Thread andrewm.roberts at sky dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=84800 --- Comment #2 from Andrew Roberts --- Rebuilt with 8-20180311 snapshot, and it now builds successfully: /usr/local/gcc/bin/gcc -v Using built-in specs. COLLECT_GCC=/usr/local/gcc/bin/gcc

[Bug driver/83206] -mfpu=auto does not work on ARM (armv7l-unknown-linux-gnueabihf)

2018-03-07 Thread andrewm.roberts at sky dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=83206 --- Comment #22 from Andrew Roberts --- The RPI Zero bug was fixed, I'm retesting with the latest snapshot (8.0.1 20180304) just to be sure it is ok. There are still a number of inconsistencies and things which could be improved. On Odroid-Xu4

[Bug target/89508] New: gcc snapshot 9.0.1 20190127 generates invalid assembler options on aarch64-unknown-linux-gnu with -march=native

2019-02-26 Thread andrewm.roberts at sky dot com
Status: UNCONFIRMED Severity: normal Priority: P3 Component: target Assignee: unassigned at gcc dot gnu.org Reporter: andrewm.roberts at sky dot com Target Milestone: --- The latest gcc 9 snapshot (gcc version 9.0.1 20190127

[Bug bootstrap/91034] In tree build of gmp fails on Raspberry Pi4 (ARM Cortex A72) with `mls r1,r4,r8,r11' not supported in ARM mode

2019-07-01 Thread andrewm.roberts at sky dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=91034 --- Comment #9 from Andrew Roberts --- For completeness I've also built gcc 8.3.0 with in tree gmp 6.1.2 using the newly built 9.1.0. And then in turn used this gcc 8.3.0 to rebuild gcc 9.1.0 with in tree gmp. So the host gcc 8.3.0 doesn't work

[Bug bootstrap/91034] In tree build of gmp fails on Raspberry Pi4 (ARM Cortex A72) with `mls r1,r4,r8,r11' not supported in ARM mode

2019-06-30 Thread andrewm.roberts at sky dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=91034 --- Comment #5 from Andrew Roberts --- OK I tried again using the latest gmp snapshot: gmp-6.1.99-20190630.tar.lz I still get the same error.

[Bug bootstrap/91034] In tree build of gmp fails on Raspberry Pi4 (ARM Cortex A72) with `mls r1,r4,r8,r11' not supported in ARM mode

2019-07-01 Thread andrewm.roberts at sky dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=91034 --- Comment #8 from Andrew Roberts --- Build of 9.1.0 using system gmp worked fine. Rebuild of 9.1.0 with in tree gmp-6.1.2 using that version of gcc also worked fine. Thus probably a host gcc compiler problem, I'll report to Raspbian.

[Bug bootstrap/91034] In tree build of gmp fails on Raspberry Pi4 (ARM Cortex A72) with `mls r1,r4,r8,r11' not supported in ARM mode

2019-06-29 Thread andrewm.roberts at sky dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=91034 --- Comment #2 from Andrew Roberts --- Building 9.1.0 with gmp 6.1.2 using "-v -save-temps" gives: gcc -v -save-temps -c -DHAVE_CONFIG_H -I. -I../../../gcc-9.1.0/gmp/mpn -I.. -D__GMP_WITHIN_GMP -I../../../gcc-9.1.0/gmp -DOPERATION_divrem_1

[Bug bootstrap/91034] In tree build of gmp fails on Raspberry Pi4 (ARM Cortex A72) with `mls r1,r4,r8,r11' not supported in ARM mode

2019-06-29 Thread andrewm.roberts at sky dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=91034 --- Comment #3 from Andrew Roberts --- Created attachment 46535 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=46535=edit tmp-divrem_1.s file generated using m4 from divrem_1.asm in gmp/mpn This is the gmp 6.1.2 version, from the gcc

[Bug bootstrap/91034] In tree build of gmp fails on Raspberry Pi4 (ARM Cortex A72) with `mls r1,r4,r8,r11' not supported in ARM mode

2019-07-01 Thread andrewm.roberts at sky dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=91034 --- Comment #11 from Andrew Roberts --- Richard, the cpu supports mls (its a ARM Cortex A72). Comment 2 shows the -v output for both building gmp within gcc and standalone. When building gmp in tree using Raspbian compiler: as --gdwarf2 -v -I

[Bug bootstrap/91034] In tree build of gmp fails on Raspberry Pi4 (ARM Cortex A72) with `mls r1,r4,r8,r11' not supported in ARM mode

2019-07-01 Thread andrewm.roberts at sky dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=91034 --- Comment #12 from Andrew Roberts --- GMP 6.1.0 and later support the following configure option: --enable-assembly enable the use of assembly loops [default=yes] not sure if this could be used to stop gmp using assembler.

[Bug bootstrap/91034] In tree build of gmp fails on Raspberry Pi4 (ARM Cortex A72) with `mls r1,r4,r8,r11' not supported in ARM mode

2019-07-01 Thread andrewm.roberts at sky dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=91034 --- Comment #13 from Andrew Roberts --- Just tried --enable-assembly=no with the standalone build of gmp and this does seem to work as advertised. Everything symlinked to .c rather than .asm files, and no .asm or .s files built at all. Building

[Bug bootstrap/91034] In tree build of gmp fails on Raspberry Pi4 (ARM Cortex A72) with `mls r1,r4,r8,r11' not supported in ARM mode

2019-07-01 Thread andrewm.roberts at sky dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=91034 --- Comment #14 from Andrew Roberts --- One final point even vanilla gcc 9.1.0 fails to build gmp standalone if CFLAGS is set, so issue with Raspbian compiler is that it is probably setting CFLAGS and thus messing up gmp build. To cause

[Bug bootstrap/91034] In tree build of gmp fails on Raspberry Pi4 (ARM Cortex A72) with `mls r1,r4,r8,r11' not supported in ARM mode

2019-07-01 Thread andrewm.roberts at sky dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=91034 --- Comment #6 from Andrew Roberts --- I'm now building gcc 9.1.0 with the system gmp (using --with-gmp-lib and --with-gmp-include). If this is successful I'll use this compiler to rebuild itself with an in tree gmp and see where that gets me.

[Bug bootstrap/91034] In tree build of gmp fails on Raspberry Pi4 (ARM Cortex A72) with `mls r1,r4,r8,r11' not supported in ARM mode

2019-07-02 Thread andrewm.roberts at sky dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=91034 --- Comment #16 from Andrew Roberts --- That kicks a memory loose, from my build script: # sed needed for GMP >=5.1 && < 6.2.0 on ARM otherwise isl build fails with # undefined symbol __gmpn_invert_limb sed -ixx "s/none-/${uname_m}-/" Makefile

[Bug bootstrap/91034] New: In tree build of gmp fails on Raspberry Pi4 (ARM Cortex A72) with `mls r1,r4,r8,r11' not supported in ARM mode

2019-06-29 Thread andrewm.roberts at sky dot com
Status: UNCONFIRMED Severity: normal Priority: P3 Component: bootstrap Assignee: unassigned at gcc dot gnu.org Reporter: andrewm.roberts at sky dot com Target Milestone: --- When building gcc 9.1.0 (or 8.3.0) on Raspberry Pi 4 (ARM

[Bug bootstrap/91034] In tree build of gmp fails on Raspberry Pi4 (ARM Cortex A72) with `mls r1,r4,r8,r11' not supported in ARM mode

2019-06-29 Thread andrewm.roberts at sky dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=91034 --- Comment #1 from Andrew Roberts --- Configure line used for building gcc 9.1.0: ../gcc-9.1.0/configure --prefix=/usr/local/gcc-9.1.0--program-suffix= --disa ble-werror --enable-shared --enable-threads=posix --enable-checking=release

  1   2   >