https://gcc.gnu.org/bugzilla/show_bug.cgi?id=53659
--- Comment #12 from PeteVine ---
Nice, PR68664 patch has fixed the issue.
FWIW, unlike previously, running on a Cortex-A53, showed perfect alignment with
core type (-mfpu=vfpv3) on the first run:
Cortex-A8
Rendering took: 1 seconds (1801
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=53659
--- Comment #11 from PeteVine ---
Super cool, thanks! That makes the OP a true prophet before his time ;)
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=53659
--- Comment #10 from James Greenhalgh ---
(In reply to PeteVine from comment #9)
> @jgreenhalgh Please have a look at the profiled assembly for both fast and
> slow codegen. (attached)
>
> According to @aldyh's bisection in #68664 this probably
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=53659
--- Comment #9 from PeteVine ---
@jgreenhalgh Please have a look at the profiled assembly for both fast and slow
codegen. (attached)
According to @aldyh's bisection in #68664 this probably isn't the same issue.
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=53659
--- Comment #8 from Siarhei Siamashka ---
Since my report predates bug 68664 by several years, shouldn't bug 68664 be a
duplicate? In addition, my report was much more detailed, since it also
provided a practical use case, showcasing the
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=53659
James Greenhalgh changed:
What|Removed |Added
Status|WAITING |RESOLVED
Resolution|---
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=53659
--- Comment #6 from PeteVine ---
Testing different 32-bit codegen options in aarch32 mode on a Cortex A53, shows
A15 is probably also affected. Full comparison below:
$ for i in 8 5 7 9 15 ; do gcc -marm -Ofast -o c-ray-a$i c-ray-mt.c -lm
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=53659
--- Comment #5 from PeteVine ---
Created attachment 39649
--> https://gcc.gnu.org/bugzilla/attachment.cgi?id=39649=edit
Annotated ARMv7 assembly
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=53659
--- Comment #4 from PeteVine ---
I've just done the obvious and run the resulting ARMv7 binaries on a Cortex A53
in aarch32 mode and the difference is there (GCC 6.2.1 and 7.0.0) so I can
confirm the issue is present to this day.
Cortex-A5 vs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=53659
--- Comment #3 from PeteVine ---
Curiously, up to gcc 6, targeting Cortex-A5 made virtually no difference, but
in gcc 7, generic codegen takes an 8% hit while -mcpu=cortex-a5 produces
roughly the same performance as before. (but that's a
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=53659
PeteVine changed:
What|Removed |Added
CC||tulipawn at gmail dot com
--- Comment #2
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=53659
Ramana Radhakrishnan ramana at gcc dot gnu.org changed:
What|Removed |Added
Status|UNCONFIRMED |WAITING
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=53659
Richard Earnshaw rearnsha at gcc dot gnu.org changed:
What|Removed |Added
Target||arm
---
13 matches
Mail list logo