Bug ID: 81828
Summary: Cilkplus performance regression on ARM...
Assignee: unassigned at gcc dot gnu.org
Reporter: ejolson at unr dot edu
Target Milestone: ---
Created attachment 41979
Graph showing performance regression...
Code for gcc version 7.1 using Cilkplus parallel programming extensions on ARM
is running much slower than the same code with version 6.2. Details may by
viewed graphically as
which consistently shows a loss of performance using any combination of 1 to 8
cores on a Samsung/Nexell S5P6818 based SBC. More information and example code
is available at
My impression is that this regression affects almost all Cilkplus code on ARM
and is possibly the result unaligned cactus stack additional overhead in
switching tasks that was not present in the 6.2 version. It is likely that
performance-based tests for ARM Cilkplus are needed to insure such regressions
do not happen in the future. Note that the performance of serial code is not
The test code was compiled for 32-bit mode using options
-fcilkplus -O3 -mcpu=cortex-a7 -mfpu=neon-vfpv4 -mfloat-abi=hard
and run under identical circumstances in both cases.