On Jul 28, 2015 2:43 AM, "Francisco Jerez" <[email protected]> wrote:
>
> The register coalesce pass wasn't rewriting the destination and
> sources of instructions that accessed the second half of a coalesced
> register previously copied with a 16-wide MOV instruction.  E.g.:
>
> | ADD (16) vgrf0:f, vgrf0:f, 1.0:f
> | MOV (16) vgrf1:f, vgrf0:f
> | MOV (8)  vgrf2:f, vgrf0+1:f { sechalf }
>
> would get incorrectly register-coalesced into:
>
> | ADD (16) vgrf1:f, vgrf1:f, 1.0:f
> | MOV (8)  vgrf2:f, vgrf0+1:f { sechalf }
>
> The reason is that the mov[i] pointer was being left equal to NULL for
> every other register.  The fact that we've made it to the rewrite loop
> implies that the whole register can been coalesced, so it doesn't seem
> useful to check that mov[i] is non-NULL every time.  Fixes an amount
> of texturing and image_load_store piglit tests on my SIMD-lowering
> branch.

Almost more to the point we don't want to coalesce a register and then not
update something that uses it. (which is what was happening.)

Reviewed-by: Jason Ekstrand <[email protected]>

> ---
>  .../drivers/dri/i965/brw_fs_register_coalesce.cpp  | 27
++++++++++------------
>  1 file changed, 12 insertions(+), 15 deletions(-)
>
> diff --git a/src/mesa/drivers/dri/i965/brw_fs_register_coalesce.cpp
b/src/mesa/drivers/dri/i965/brw_fs_register_coalesce.cpp
> index 4544122..c078318 100644
> --- a/src/mesa/drivers/dri/i965/brw_fs_register_coalesce.cpp
> +++ b/src/mesa/drivers/dri/i965/brw_fs_register_coalesce.cpp
> @@ -228,7 +228,6 @@ fs_visitor::register_coalesce()
>           continue;
>
>        progress = true;
> -      bool was_load_payload = inst->opcode == SHADER_OPCODE_LOAD_PAYLOAD;
>
>        for (int i = 0; i < src_size; i++) {
>           if (mov[i]) {
> @@ -243,20 +242,18 @@ fs_visitor::register_coalesce()
>
>        foreach_block_and_inst(block, fs_inst, scan_inst, cfg) {
>           for (int i = 0; i < src_size; i++) {
> -            if (mov[i] || was_load_payload) {
> -               if (scan_inst->dst.file == GRF &&
> -                   scan_inst->dst.reg == reg_from &&
> -                   scan_inst->dst.reg_offset == i) {
> -                  scan_inst->dst.reg = reg_to;
> -                  scan_inst->dst.reg_offset = reg_to_offset[i];
> -               }
> -               for (int j = 0; j < scan_inst->sources; j++) {
> -                  if (scan_inst->src[j].file == GRF &&
> -                      scan_inst->src[j].reg == reg_from &&
> -                      scan_inst->src[j].reg_offset == i) {
> -                     scan_inst->src[j].reg = reg_to;
> -                     scan_inst->src[j].reg_offset = reg_to_offset[i];
> -                  }
> +            if (scan_inst->dst.file == GRF &&
> +                scan_inst->dst.reg == reg_from &&
> +                scan_inst->dst.reg_offset == i) {
> +               scan_inst->dst.reg = reg_to;
> +               scan_inst->dst.reg_offset = reg_to_offset[i];
> +            }
> +            for (int j = 0; j < scan_inst->sources; j++) {
> +               if (scan_inst->src[j].file == GRF &&
> +                   scan_inst->src[j].reg == reg_from &&
> +                   scan_inst->src[j].reg_offset == i) {
> +                  scan_inst->src[j].reg = reg_to;
> +                  scan_inst->src[j].reg_offset = reg_to_offset[i];
>                 }
>              }
>           }
> --
> 2.4.6
>
_______________________________________________
mesa-dev mailing list
[email protected]
http://lists.freedesktop.org/mailman/listinfo/mesa-dev

Reply via email to