https://gcc.gnu.org/bugzilla/show_bug.cgi?id=89853
--- Comment #2 from Martin Jambor <jamborm at gcc dot gnu.org> ---
Doh, yes, copy-paste error, sorry. The data should have been:
FAST:
Performance counter stats for 'numactl -C 0 -l specinvoke':
495413.105450 task-clock:u (msec) # 0.999 CPUs utilized
0 context-switches:u # 0.000 K/sec
0 cpu-migrations:u # 0.000 K/sec
80572 page-faults:u # 0.163 K/sec
1573525941814 cycles:u # 3.176 GHz
(83.33%)
56730573392 stalled-cycles-frontend:u # 3.61% frontend cycles
idle (83.33%)
397644125819 stalled-cycles-backend:u # 25.27% backend cycles
idle (83.33%)
5157395976259 instructions:u # 3.28 insn per cycle
# 0.08 stalled cycles per
insn (83.33%)
421019689027 branches:u # 849.836 M/sec
(83.33%)
10705813341 branch-misses:u # 2.54% of all branches
(83.33%)
495.869208013 seconds time elapsed
# Event count (approx.): 1576108148398
#
# Overhead Samples Command Shared Object Symbol
# ........ ............ ............... ............................
.................................................
#
14.20% 282290 x264_r_base.min x264_r_base.mine-gen-std-m64 [.]
x264_pixel_satd_8x4
11.19% 222403 x264_r_base.min x264_r_base.mine-gen-std-m64 [.]
get_ref
10.82% 215061 x264_r_base.min x264_r_base.mine-gen-std-m64 [.]
x264_pixel_sad_x4_16x16
7.00% 139082 x264_r_base.min x264_r_base.mine-gen-std-m64 [.]
x264_pixel_sad_16x16
6.11% 121470 x264_r_base.min x264_r_base.mine-gen-std-m64 [.]
x264_pixel_sad_x3_16x16
5.89% 116939 x264_r_base.min x264_r_base.mine-gen-std-m64 [.]
x264_pixel_sad_x4_8x8
5.09% 101266 x264_r_base.min x264_r_base.mine-gen-std-m64 [.]
quant_4x4
4.10% 81471 x264_r_base.min x264_r_base.mine-gen-std-m64 [.]
mc_chroma
2.47% 49122 x264_r_base.min x264_r_base.mine-gen-std-m64 [.]
x264_pixel_sad_x3_8x8
2.21% 43928 x264_r_base.min x264_r_base.mine-gen-std-m64 [.]
sub4x4_dct
2.14% 42598 x264_r_base.min x264_r_base.mine-gen-std-m64 [.]
pixel_hadamard_ac
SLOW:
Performance counter stats for 'numactl -C 0 -l specinvoke':
526858.531112 task-clock:u (msec) # 0.999 CPUs utilized
0 context-switches:u # 0.000 K/sec
0 cpu-migrations:u # 0.000 K/sec
81064 page-faults:u # 0.154 K/sec
1673634535742 cycles:u # 3.177 GHz
(83.33%)
64458929239 stalled-cycles-frontend:u # 3.85% frontend cycles
idle (83.33%)
397586117982 stalled-cycles-backend:u # 23.76% backend cycles
idle (83.33%)
5157346862311 instructions:u # 3.08 insn per cycle
# 0.08 stalled cycles per
insn (83.33%)
421082988475 branches:u # 799.234 M/sec
(83.33%)
14226205709 branch-misses:u # 3.38% of all branches
(83.33%)
527.353829377 seconds time elapsed
# Samples: 2M of event 'cycles'
# Event count (approx.): 1675655436335
#
# Overhead Samples Command Shared Object
Symbol
# ........ ............ ............... ............................
.................................................
#
14.13% 298519 x264_r_base.min x264_r_base.mine-gen-std-m64 [.]
x264_pixel_sad_x4_16x16
13.43% 283793 x264_r_base.min x264_r_base.mine-gen-std-m64 [.]
x264_pixel_satd_8x4
11.56% 244196 x264_r_base.min x264_r_base.mine-gen-std-m64 [.]
get_ref
7.17% 151589 x264_r_base.min x264_r_base.mine-gen-std-m64 [.]
x264_pixel_sad_x3_16x16
6.29% 132936 x264_r_base.min x264_r_base.mine-gen-std-m64 [.]
x264_pixel_sad_16x16
5.28% 111517 x264_r_base.min x264_r_base.mine-gen-std-m64 [.]
x264_pixel_sad_x4_8x8
4.84% 102317 x264_r_base.min x264_r_base.mine-gen-std-m64 [.]
quant_4x4
3.86% 81563 x264_r_base.min x264_r_base.mine-gen-std-m64 [.]
mc_chroma
2.57% 54233 x264_r_base.min x264_r_base.mine-gen-std-m64 [.]
x264_pixel_sad_x3_8x8
2.08% 43964 x264_r_base.min x264_r_base.mine-gen-std-m64 [.]
sub4x4_dct
2.01% 42520 x264_r_base.min x264_r_base.mine-gen-std-m64 [.]
pixel_hadamard_ac