Kenneth, would we want this patch in 17.1 or we shouldn't bother ?

On Tue, 2017-05-30 at 16:28 -0700, Kenneth Graunke wrote:
> As of 4.11, the kernel isn't bothering to set the subslice hashing mode
> on Apollolake, leaving it at the default of 8x8.  (It initializes it to
> 16x4 on most platforms.)
> 
> Performance data for GPUTest Triangle on Apollolake at 1024x640:
> 
>    X-tiled RT:
>    -----------
>    8x8 -> 16x4:   2.4325%  +/- 0.383683% (n=107)
>    8x8 -> 8x4:   -3.75105% +/- 0.592491% (n=40)
>    8x8 -> 16x16:  6.17238% +/- 0.67157%  (n=30)
> 
>    Y-tiled RT:
>    -----------
>    8x8 -> 16x4:   1.30307%  +/- 0.297292% (n=205)
>    8x8 -> 8x4:   -0.769282% +/- 0.729557% (n=35)
>    8x8 -> 16x16:  3.00254%  +/- 0.715503% (n=40)
> 
>    8x MSAA RT (INTEL_FORCE_MSAA=8):
>    --------------------------------
>    8x8 -> 16x4:   1.38889% +/- 0.93729%  (n=7)
>    8x8 -> 8x4:   -2.10643% +/- 1.15153%  (n=3)
>    8x8 -> 16x16:  3.87183% +/- 1.08851%  (n=5)
> 
> Based on this, we choose 16x16 for Apollolake.
> 
> Skylake GT2 with X-tiled buffers appears to be a toss-up between 16x4
> and 16x16, and with Y-tiled buffers it doesn't seem to really matter.
> So we'll leave Skylake alone for now.
> 
> The hashing mode doesn't seem to make a measurable impact on more
> complex benchmarks.
> ---
>  src/mesa/drivers/dri/i965/brw_defines.h      | 7 +++++++
>  src/mesa/drivers/dri/i965/brw_state_upload.c | 9 +++++++++
>  2 files changed, 16 insertions(+)
> 
> diff --git a/src/mesa/drivers/dri/i965/brw_defines.h 
> b/src/mesa/drivers/dri/i965/brw_defines.h
> index 312dddafd77..1278634269a 100644
> --- a/src/mesa/drivers/dri/i965/brw_defines.h
> +++ b/src/mesa/drivers/dri/i965/brw_defines.h
> @@ -1616,6 +1616,13 @@ enum brw_pixel_shader_coverage_mask_mode {
>  # define GEN8_HIZ_PMA_MASK_BITS \
>     REG_MASK(GEN8_HIZ_NP_PMA_FIX_ENABLE | GEN8_HIZ_NP_EARLY_Z_FAILS_DISABLE)
>  
> +#define GEN7_GT_MODE                    0x7008
> +# define GEN9_SUBSLICE_HASHING_8x8      (0 << 8)
> +# define GEN9_SUBSLICE_HASHING_16x4     (1 << 8)
> +# define GEN9_SUBSLICE_HASHING_8x4      (2 << 8)
> +# define GEN9_SUBSLICE_HASHING_16x16    (3 << 8)
> +# define GEN9_SUBSLICE_HASHING_MASK_BITS REG_MASK(3 << 8)
> +
>  /* Predicate registers */
>  #define MI_PREDICATE_SRC0               0x2400
>  #define MI_PREDICATE_SRC1               0x2408
> diff --git a/src/mesa/drivers/dri/i965/brw_state_upload.c 
> b/src/mesa/drivers/dri/i965/brw_state_upload.c
> index 4647f1c41e0..6a8547c4ede 100644
> --- a/src/mesa/drivers/dri/i965/brw_state_upload.c
> +++ b/src/mesa/drivers/dri/i965/brw_state_upload.c
> @@ -70,6 +70,15 @@ brw_upload_initial_gpu_state(struct brw_context *brw)
>                  GEN9_FLOAT_BLEND_OPTIMIZATION_ENABLE |
>                  GEN9_PARTIAL_RESOLVE_DISABLE_IN_VC);
>        ADVANCE_BATCH();
> +
> +      if (brw->is_broxton) {
> +         BEGIN_BATCH(3);
> +         OUT_BATCH(MI_LOAD_REGISTER_IMM | (3 - 2));
> +         OUT_BATCH(GEN7_GT_MODE);
> +         OUT_BATCH(GEN9_SUBSLICE_HASHING_MASK_BITS |
> +                   GEN9_SUBSLICE_HASHING_16x16);
> +         ADVANCE_BATCH();
> +      }
>     }
>  
>     if (brw->gen >= 8) {
-- 
Br,

Andres
_______________________________________________
mesa-dev mailing list
mesa-dev@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/mesa-dev

Reply via email to