Here is the comparison of 4.1 branch and 4.2 branch. In brief, 4.2
has 0.47% better performance in SPECInt2000 and 2.2% better
performance in SPECFP2000.
As I remeber this increase in SPECFP performance is as mostly from
implementation of Itanium speculation support for scheduling by ISP
RAS (many thanks for implementing that). Generally speaking, the
speculation insns isn inserted when we can not say something about
aliasing (so it compensates too conseravtive aliasing).
The code size (text segment) is 0.3% and 1.2% bigger corresponding
on SPECINT2000 and SPECFP2000 for 4.2. It is not important for
itanium because it is the least sensitive processor for code locality
(btw intel compiler generates up 2 times bigger code).
The 4.2 compiler is 19% slower on compilation SPECINT2000 (both
versions of compiler compiled with --enable-checking=release). I
don't know why. I have no time for the investiagation.
Itanium 1.6Ghz. Option is -O2
base: 4.1
peak: 4.2
Estimated Estimated
Base Base Base Peak Peak Peak
Benchmarks Ref Time Run Time Ratio Ref Time Run Time Ratio
------------ -------- -------- -------- -------- -------- --------
164.gzip 1400 207 676* 1400 194 720*
175.vpr 1400 156 900* 1400 156 898*
176.gcc 1100 91.9 1197* 1100 92.3 1191*
181.mcf 1800 170 1057* 1800 165 1093*
186.crafty 1000 106 947* 1000 108 927*
197.parser 1800 245 734* 1800 245 734*
252.eon 1300 166 781* 1300 173 751*
253.perlbmk 1800 199 903* 1800 207 872*
254.gap 1100 214 513* 1100 205 537*
255.vortex 1900 185 1027* 1900 189 1006*
256.bzip2 1500 200 749* 1500 192 783*
300.twolf 3000 272 1102* 3000 273 1098*
Est. SPECint_base2000 860
Est. SPECint2000 864
Estimated Estimated
Base Base Base Peak Peak Peak
Benchmarks Ref Time Run Time Ratio Ref Time Run Time Ratio
------------ -------- -------- -------- -------- -------- --------
168.wupwise 1600 358 448* 1600 347 462*
171.swim 3100 577 538* 3100 572 542*
172.mgrid 1800 595 303* 1800 659 273*
173.applu 2100 534 393* 2100 538 390*
177.mesa 1400 173 811* 1400 173 809*
178.galgel 2900 456 636* 2900 340 853*
179.art 2600 128 2025* 2600 122 2125*
183.equake 1300 387 336* 1300 384 339*
187.facerec 1900 427 445* 1900 432 439*
188.ammp 2200 305 722* 2200 272 808*
189.lucas 2000 252 795* 2000 252 794*
191.fma3d 2100 788 266* 2100 799 263*
200.sixtrack 1100 313 352* 1100 329 334*
301.apsi 2600 486 535* 2600 486 535*
Est. SPECfp_base2000 527
Est. SPECfp2000 539
Code size
----------------CINT2000-----------------
-3.813% 91125 87650 164.gzip
-0.864% 295303 292751 175.vpr
2.671% 3205470 3291094 176.gcc
0.509% 25148 25276 181.mcf
-0.377% 407562 406026 186.crafty
-0.529% 238765 237501 197.parser
3.022% 929950 958057 252.eon
1.614% 1391692 1414148 253.perlbmk
-0.599% 1150630 1143742 254.gap
1.610% 1163556 1182292 255.vortex
0.227% 69658 69816 256.bzip2
0.672% 494901 498229 300.twolf
Average = 0.27621%
----------------CFP2000-----------------
6.248% 51217 54417 168.wupwise
1.425% 20770 21066 171.swim
6.089% 24437 25925 172.mgrid
-3.501% 108097 104313 173.applu
0.087% 1254482 1255570 177.mesa
-0.684% 364988 362491 178.galgel
-0.793% 35299 35019 179.art
1.827% 43781 44581 183.equake
0.359% 119370 119798 187.facerec
0.202% 376484 377244 188.ammp
-0.612% 86159 85632 189.lucas
2.302% 2323132 2376611 191.fma3d
0.943% 2052547 2071911 200.sixtrack
5.787% 270406 286054 301.apsi
Average = 1.15761%
Compilation time of SPECINT2000
4.1:
real 7m41.105s
user 6m31.680s
sys 0m13.587s
4.2
real 8m46.611s
user 7m45.092s
sys 0m13.562s