On Mon, Jul 17, 2023 at 07:30:57PM +0200, Andi Shyti wrote:
> Enable the CCS_FLUSH bit 13 in the control pipe for render and
> compute engines in platforms starting from Meteor Lake (BSPEC
> 43904 and 47112).
> 
> Fixes: 972282c4cf24 ("drm/i915/gen12: Add aux table invalidate for all 
> engines")
> Signed-off-by: Andi Shyti <andi.sh...@linux.intel.com>
> Cc: Jonathan Cavitt <jonathan.cav...@intel.com>
> Cc: Nirmoy Das <nirmoy....@intel.com>
> Cc: <sta...@vger.kernel.org> # v5.8+
> ---
>  drivers/gpu/drm/i915/gt/gen8_engine_cs.c     | 10 +++++++++-
>  drivers/gpu/drm/i915/gt/intel_engine_types.h |  1 +
>  drivers/gpu/drm/i915/gt/intel_gpu_commands.h |  1 +
>  3 files changed, 11 insertions(+), 1 deletion(-)
> 
> diff --git a/drivers/gpu/drm/i915/gt/gen8_engine_cs.c 
> b/drivers/gpu/drm/i915/gt/gen8_engine_cs.c
> index 3c935d6b68bf0..aa2fb9d72745a 100644
> --- a/drivers/gpu/drm/i915/gt/gen8_engine_cs.c
> +++ b/drivers/gpu/drm/i915/gt/gen8_engine_cs.c
> @@ -207,7 +207,7 @@ int gen12_emit_flush_rcs(struct i915_request *rq, u32 
> mode)
>        * memory traffic is quiesced prior.
>        */
>       if ((mode & EMIT_INVALIDATE) && !HAS_FLAT_CCS(engine->i915))
> -             mode |= EMIT_FLUSH;
> +             mode |= EMIT_FLUSH | EMIT_CCS_FLUSH;

Do we even really need the extra EMIT_* flag?  It seems like just doing
the CCS flush on graphics 12.70 and beyond would probably be fine since
EMIT_FLUSH is only used in two places on those platforms:  an
EMIT_BARRIER in intel_engine_emit_ctx_wa (which happens during device
init, before we've had an opportunity to use CCS for anything) and the
new flush we're adding here in aux invalidation.  All other uses of
EMIT_FLUSH in the driver are specific to non-GuC execlist submission or
to the old ringbuffer-based submission on pre-gen8 platforms.

Anyway, adding the extra condition shouldn't really hurt anything
either, so up to you whether you want to drop it or not.

Reviewed-by: Matt Roper <matthew.d.ro...@intel.com>

>  
>       if (mode & EMIT_FLUSH) {
>               u32 bit_group_0 = 0;
> @@ -221,6 +221,14 @@ int gen12_emit_flush_rcs(struct i915_request *rq, u32 
> mode)
>  
>               bit_group_0 |= PIPE_CONTROL0_HDC_PIPELINE_FLUSH;
>  
> +             /*
> +              * When required, in MTL+ platforms we need to
> +              * set the CCS_FLUSH bit in the pipe control
> +              */
> +             if (mode & EMIT_CCS_FLUSH &&
> +                 GRAPHICS_VER_FULL(rq->i915) >= IP_VER(12, 70))
> +                     bit_group_0 |= PIPE_CONTROL_CCS_FLUSH;
> +
>               bit_group_1 |= PIPE_CONTROL_TILE_CACHE_FLUSH;
>               bit_group_1 |= PIPE_CONTROL_FLUSH_L3;
>               bit_group_1 |= PIPE_CONTROL_RENDER_TARGET_CACHE_FLUSH;
> diff --git a/drivers/gpu/drm/i915/gt/intel_engine_types.h 
> b/drivers/gpu/drm/i915/gt/intel_engine_types.h
> index e99a6fa03d453..e2cae9d02bd62 100644
> --- a/drivers/gpu/drm/i915/gt/intel_engine_types.h
> +++ b/drivers/gpu/drm/i915/gt/intel_engine_types.h
> @@ -514,6 +514,7 @@ struct intel_engine_cs {
>       int             (*emit_flush)(struct i915_request *request, u32 mode);
>  #define EMIT_INVALIDATE      BIT(0)
>  #define EMIT_FLUSH   BIT(1)
> +#define EMIT_CCS_FLUSH       BIT(2) /* MTL+ */
>  #define EMIT_BARRIER (EMIT_INVALIDATE | EMIT_FLUSH)
>       int             (*emit_bb_start)(struct i915_request *rq,
>                                        u64 offset, u32 length,
> diff --git a/drivers/gpu/drm/i915/gt/intel_gpu_commands.h 
> b/drivers/gpu/drm/i915/gt/intel_gpu_commands.h
> index 5d143e2a8db03..5df7cce23197c 100644
> --- a/drivers/gpu/drm/i915/gt/intel_gpu_commands.h
> +++ b/drivers/gpu/drm/i915/gt/intel_gpu_commands.h
> @@ -299,6 +299,7 @@
>  #define   PIPE_CONTROL_QW_WRITE                              (1<<14)
>  #define   PIPE_CONTROL_POST_SYNC_OP_MASK                (3<<14)
>  #define   PIPE_CONTROL_DEPTH_STALL                   (1<<13)
> +#define   PIPE_CONTROL_CCS_FLUSH                     (1<<13) /* MTL+ */
>  #define   PIPE_CONTROL_WRITE_FLUSH                   (1<<12)
>  #define   PIPE_CONTROL_RENDER_TARGET_CACHE_FLUSH     (1<<12) /* gen6+ */
>  #define   PIPE_CONTROL_INSTRUCTION_CACHE_INVALIDATE  (1<<11) /* MBZ on ILK */
> -- 
> 2.40.1
> 

-- 
Matt Roper
Graphics Software Engineer
Linux GPU Platform Enablement
Intel Corporation

Reply via email to