On Mon, Mar 12, 2018 at 08:02:39PM -0500, Aaron Sawdey wrote:
> Looking at CPU2017 results for different reassociation widths, things
> have shifted since I last looked at this with CPU2006 in early gcc7
> timeframe. Best thing to do seems to be to set reassociation width to 1
> for all integer modes, which is what the attached patch does.
> 
> I also tried setting width to 1 for float modes PLUS_EXPR as this patch
> did for aarch64 but this does not seem to be helpful for power8.
> https://gcc.gnu.org/ml/gcc-patches/2018-02/msg01271.html
> 
> 
> Results below are % performance improvement on power8 comparing trunk
> with the attached patch vs trunk with --param tree-reassoc-width=1 to
> disable parallel reassociation for everything (first column of results)
> and trunk unmodified (second column of results). 
> 
> CPU2017 component    vs width=1   vs trunk
> 500.perlbench_r        -0.36%     -0.15%
> 502.gcc_r               0.06%      0.04%
> 505.mcf_r               0.32%      0.24%
> 520.omnetpp_r           0.57%     -0.95%
> 523.xalancbmk_r         1.45%      1.04%
> 525.x264_r             -0.05%      0.09%
> 531.deepsjeng_r         0.04%      0.09%
> 541.leela_r             0.10%      0.72%
> 548.exchange2_r         0.08%      0.73%
> 557.xz_r                0.09%      2.12%
> CPU2017 int geo mean    0.23%      0.40%
> 503.bwaves_r            0.00%      0.01%
> 507.cactuBSSN_r         0.05%     -0.02%
> 508.namd_r              0.00%      0.00%
> 510.parest_r           -0.01%      0.20%
> 511.povray_r            0.03%     -0.24%
> 519.lbm_r              -0.04%     -0.16%
> 521.wrf_r              -0.01%     -0.56%
> 526.blender_r          -0.82%     -0.47%
> 527.cam4_r             -0.18%      0.06%
> 538.imagick_r          -0.02%      0.01%
> 544.nab_r               0.00%      0.23%
> 549.fotonik3d_r         0.24%      0.54%
> 554.roms_r             -0.05%      0.03%
> CPU2017 fp geo mean    -0.06%     -0.03%
> 
> Bottom line is net improvement for CPU2017 int compared with either
> current trunk, or disabling parallel reassociation. For CPU2017 fp,
> very small overall degradation. 
> 
> Currently doing regstrap on ppc64le, ok for trunk if results look good?

Can't argue with a 0.4% win :-)  Okay for trunk.  Thanks!


Segher


> 2018-03-12  Aaron Sawdey  <acsaw...@linux.vnet.ibm.com>
> 
>       PR target/84743
>       * config/rs6000/rs6000.c (rs6000_reassociation_width): Disable parallel
>       reassociation for int modes.

Reply via email to