Re: [Intel-gfx] [PATCH 12/20] drm/i915/icl: Check for fused-off VDBOX and VEBOX instances
On 17/02/18 06:17, Sagar Arun Kamble wrote: On 2/17/2018 5:48 PM, Chris Wilson wrote: Quoting Sagar Arun Kamble (2018-02-17 12:10:32) On 2/17/2018 2:34 PM, Chris Wilson wrote: Quoting Sagar Arun Kamble (2018-02-17 08:51:44) Earlier I had thought of calling ASSIGN_FW_DOMAINS_TABLE, ASSIGN_*_MMIO_VFUNCS before intel_uncore_fw_domains_init and use I915_READ here for reading the fuse. But that approach seems to expose the vfuncs and forcewake table before fw domains are initialized. Although we can get to know invalid access to read/write accessors before fw_domains get initialized, current ordering of fw_domains init followed by fw domain table/read/write vfuncs init seems right. So using fw_domains_get/put as suggested above should be the way. Chris, Tvrtko, Mika, do you agree? What's the complication with the fw_domains? Do the additional powerwell depend on the fused status of the extra engines? Does it matter if the fw_domain are prepped if they are never used? Yes. To discover the available VD/VE engines/power domains, fuse needs to be read under blitter forcewake as RC6 will be enabled by BIOS. We do have usage of forcewake in IVB to discover FORCEWAKE_MT availability in fw_domains_init. It should not be a problem if they are prepped but never used. Imo, I would have placed the fused discovery in intel_engines_init_mmio() (where we do the setup and can take forcewake). Then adding something like intel_uncore_reinit_mmio() (which would just prune the uncore->fw_domains) after checking fused status with commentary doesn't seem that horrible. Yes. This approach looks good too. But, we might want to optimize the driver_load to avoid this setup at first place instead of pruning later. How many cycles does it take to run through all domains and set up the register offsets? No mmio access required right, we are just moving memory around without even hitting locked instructions? Yes. Latency is minimal. We can go with your suggestion. Will need to maintain separation of engine_cs/device_info/uncore update through separate functions. We do currently call fw_domain_reset on all domains at initialization time, which could write to a register that doesn't exist. We don't wait for the ack and I assume the write would just be dropped if the power well is not there so it shouldn't be an issue, but it might be worth adding a comment in fw_domain_reset to remind us that we can't start waiting for the ack in there unless we move the fuse read to an earlier point. Daniele Another related setup that can be avoided/pruned is the fw_range for the engine fused off as it can improve the fw lookup. Which is a bsearch on register range, I doubt that's going to be substantially impacted by removing a few ranges. Where it matters, we should be looking to precalculate the result anyway. Yes. seems not so worth. -Chris ___ Intel-gfx mailing list Intel-gfx@lists.freedesktop.org https://lists.freedesktop.org/mailman/listinfo/intel-gfx
Re: [Intel-gfx] [PATCH 12/20] drm/i915/icl: Check for fused-off VDBOX and VEBOX instances
On 2/17/2018 5:48 PM, Chris Wilson wrote: Quoting Sagar Arun Kamble (2018-02-17 12:10:32) On 2/17/2018 2:34 PM, Chris Wilson wrote: Quoting Sagar Arun Kamble (2018-02-17 08:51:44) Earlier I had thought of calling ASSIGN_FW_DOMAINS_TABLE, ASSIGN_*_MMIO_VFUNCS before intel_uncore_fw_domains_init and use I915_READ here for reading the fuse. But that approach seems to expose the vfuncs and forcewake table before fw domains are initialized. Although we can get to know invalid access to read/write accessors before fw_domains get initialized, current ordering of fw_domains init followed by fw domain table/read/write vfuncs init seems right. So using fw_domains_get/put as suggested above should be the way. Chris, Tvrtko, Mika, do you agree? What's the complication with the fw_domains? Do the additional powerwell depend on the fused status of the extra engines? Does it matter if the fw_domain are prepped if they are never used? Yes. To discover the available VD/VE engines/power domains, fuse needs to be read under blitter forcewake as RC6 will be enabled by BIOS. We do have usage of forcewake in IVB to discover FORCEWAKE_MT availability in fw_domains_init. It should not be a problem if they are prepped but never used. Imo, I would have placed the fused discovery in intel_engines_init_mmio() (where we do the setup and can take forcewake). Then adding something like intel_uncore_reinit_mmio() (which would just prune the uncore->fw_domains) after checking fused status with commentary doesn't seem that horrible. Yes. This approach looks good too. But, we might want to optimize the driver_load to avoid this setup at first place instead of pruning later. How many cycles does it take to run through all domains and set up the register offsets? No mmio access required right, we are just moving memory around without even hitting locked instructions? Yes. Latency is minimal. We can go with your suggestion. Will need to maintain separation of engine_cs/device_info/uncore update through separate functions. Another related setup that can be avoided/pruned is the fw_range for the engine fused off as it can improve the fw lookup. Which is a bsearch on register range, I doubt that's going to be substantially impacted by removing a few ranges. Where it matters, we should be looking to precalculate the result anyway. Yes. seems not so worth. -Chris -- Thanks, Sagar ___ Intel-gfx mailing list Intel-gfx@lists.freedesktop.org https://lists.freedesktop.org/mailman/listinfo/intel-gfx
Re: [Intel-gfx] [PATCH 12/20] drm/i915/icl: Check for fused-off VDBOX and VEBOX instances
Quoting Sagar Arun Kamble (2018-02-17 12:10:32) > > > On 2/17/2018 2:34 PM, Chris Wilson wrote: > > Quoting Sagar Arun Kamble (2018-02-17 08:51:44) > >> Earlier I had thought of calling ASSIGN_FW_DOMAINS_TABLE, > >> ASSIGN_*_MMIO_VFUNCS before intel_uncore_fw_domains_init > >> and use I915_READ here for reading the fuse. But that approach seems to > >> expose the vfuncs and forcewake table before fw domains > >> are initialized. Although we can get to know invalid access to > >> read/write accessors before fw_domains get initialized, current ordering > >> of fw_domains init followed by fw domain table/read/write vfuncs init > >> seems right. So using fw_domains_get/put as suggested above > >> should be the way. > >> > >> Chris, Tvrtko, Mika, do you agree? > > What's the complication with the fw_domains? Do the additional powerwell > > depend on the fused status of the extra engines? Does it matter if the > > fw_domain are prepped if they are never used? > Yes. To discover the available VD/VE engines/power domains, fuse needs > to be read under blitter forcewake as > RC6 will be enabled by BIOS. We do have usage of forcewake in IVB to > discover FORCEWAKE_MT availability in fw_domains_init. > It should not be a problem if they are prepped but never used. > > Imo, I would have placed the fused discovery in > > intel_engines_init_mmio() (where we do the setup and can take forcewake). > > Then adding something like intel_uncore_reinit_mmio() (which would just > > prune the uncore->fw_domains) after checking fused status with commentary > > doesn't seem that horrible. > Yes. This approach looks good too. But, we might want to optimize the > driver_load to avoid this setup at first place > instead of pruning later. How many cycles does it take to run through all domains and set up the register offsets? No mmio access required right, we are just moving memory around without even hitting locked instructions? > Another related setup that can be avoided/pruned is the fw_range for the > engine fused off as it can improve the fw lookup. Which is a bsearch on register range, I doubt that's going to be substantially impacted by removing a few ranges. Where it matters, we should be looking to precalculate the result anyway. -Chris ___ Intel-gfx mailing list Intel-gfx@lists.freedesktop.org https://lists.freedesktop.org/mailman/listinfo/intel-gfx
Re: [Intel-gfx] [PATCH 12/20] drm/i915/icl: Check for fused-off VDBOX and VEBOX instances
On 2/17/2018 2:34 PM, Chris Wilson wrote: Quoting Sagar Arun Kamble (2018-02-17 08:51:44) Earlier I had thought of calling ASSIGN_FW_DOMAINS_TABLE, ASSIGN_*_MMIO_VFUNCS before intel_uncore_fw_domains_init and use I915_READ here for reading the fuse. But that approach seems to expose the vfuncs and forcewake table before fw domains are initialized. Although we can get to know invalid access to read/write accessors before fw_domains get initialized, current ordering of fw_domains init followed by fw domain table/read/write vfuncs init seems right. So using fw_domains_get/put as suggested above should be the way. Chris, Tvrtko, Mika, do you agree? What's the complication with the fw_domains? Do the additional powerwell depend on the fused status of the extra engines? Does it matter if the fw_domain are prepped if they are never used? Yes. To discover the available VD/VE engines/power domains, fuse needs to be read under blitter forcewake as RC6 will be enabled by BIOS. We do have usage of forcewake in IVB to discover FORCEWAKE_MT availability in fw_domains_init. It should not be a problem if they are prepped but never used. Imo, I would have placed the fused discovery in intel_engines_init_mmio() (where we do the setup and can take forcewake). Then adding something like intel_uncore_reinit_mmio() (which would just prune the uncore->fw_domains) after checking fused status with commentary doesn't seem that horrible. Yes. This approach looks good too. But, we might want to optimize the driver_load to avoid this setup at first place instead of pruning later. Another related setup that can be avoided/pruned is the fw_range for the engine fused off as it can improve the fw lookup. -Chris -- Thanks, Sagar ___ Intel-gfx mailing list Intel-gfx@lists.freedesktop.org https://lists.freedesktop.org/mailman/listinfo/intel-gfx
Re: [Intel-gfx] [PATCH 12/20] drm/i915/icl: Check for fused-off VDBOX and VEBOX instances
Quoting Sagar Arun Kamble (2018-02-17 08:51:44) > Earlier I had thought of calling ASSIGN_FW_DOMAINS_TABLE, > ASSIGN_*_MMIO_VFUNCS before intel_uncore_fw_domains_init > and use I915_READ here for reading the fuse. But that approach seems to > expose the vfuncs and forcewake table before fw domains > are initialized. Although we can get to know invalid access to > read/write accessors before fw_domains get initialized, current ordering > of fw_domains init followed by fw domain table/read/write vfuncs init > seems right. So using fw_domains_get/put as suggested above > should be the way. > > Chris, Tvrtko, Mika, do you agree? What's the complication with the fw_domains? Do the additional powerwell depend on the fused status of the extra engines? Does it matter if the fw_domain are prepped if they are never used? Imo, I would have placed the fused discovery in intel_engines_init_mmio() (where we do the setup and can take forcewake). Then adding something like intel_uncore_reinit_mmio() (which would just prune the uncore->fw_domains) after checking fused status with commentary doesn't seem that horrible. -Chris ___ Intel-gfx mailing list Intel-gfx@lists.freedesktop.org https://lists.freedesktop.org/mailman/listinfo/intel-gfx
Re: [Intel-gfx] [PATCH 12/20] drm/i915/icl: Check for fused-off VDBOX and VEBOX instances
On 2/13/2018 10:07 PM, Mika Kuoppala wrote: From: Oscar MateoIn Gen11, the Video Decode engines (aka VDBOX, aka VCS, aka BSD) and the Video Enhancement engines (aka VEBOX, aka VECS) could be fused off. Also, each VDBOX and VEBOX has its own power well, which only exist if the related engine exists in the HW. Unfortunately, we have a Catch-22 situation going on: we need to read an MMIO register with the fuse info, but we cannot fully enable MMIO until we read it (since we need the real engines to initialize the forcewake domains). We need to ensure BLITTER is initialized first and use low level functions fw_domains_get/put() around raw read to know these engines status. We workaround this problem by reading the fuse after the MMIO is partially ready, but before we initialize forcewake. Bspec: 20680 v2: We were shifting incorrectly for vebox disable (Vinay) v3: Assert mmio is ready and warn if we have attempted to initialize forcewake for fused-off engines (Paulo) v4: - Use INTEL_GEN in new code (Tvrtko) - Shorter local variable (Tvrtko, Michal) - Keep "if (!...) continue" style (Tvrtko) - No unnecessary BUG_ON (Tvrtko) - WARN_ON and cleanup if wrong mask (Tvrtko, Michal) - Use I915_READ_FW (Michal) - Use I915_MAX_VCS/VECS macros (Michal) v5: Rebased by Rodrigo fixing conflicts on top of: commit 33def1ff7b0 ("drm/i915: Simplify intel_engines_init") v6: Fix v5. Remove info->num_rings. (by Oscar) v7: Rebase (Rodrigo). v8: - s/intel_device_info_fused_off_engines/intel_device_info_init_mmio (Chris) - Make vdbox_disable & vebox_disable local variables (Chris) Cc: Paulo Zanoni Cc: Vinay Belgaumkar Cc: Tvrtko Ursulin Cc: Michal Wajdeczko Cc: Chris Wilson Signed-off-by: Rodrigo Vivi Signed-off-by: Oscar Mateo --- drivers/gpu/drm/i915/i915_drv.c | 2 ++ drivers/gpu/drm/i915/i915_drv.h | 1 + drivers/gpu/drm/i915/i915_reg.h | 5 +++ drivers/gpu/drm/i915/intel_device_info.c | 54 4 files changed, 62 insertions(+) diff --git a/drivers/gpu/drm/i915/i915_drv.c b/drivers/gpu/drm/i915/i915_drv.c index 9380c9f69b0f..43b2f620bca7 100644 --- a/drivers/gpu/drm/i915/i915_drv.c +++ b/drivers/gpu/drm/i915/i915_drv.c @@ -1033,6 +1033,8 @@ static int i915_driver_init_mmio(struct drm_i915_private *dev_priv) if (ret < 0) goto err_bridge; + intel_device_info_init_mmio(dev_priv); This should be called during intel_uncore_fw_domains_init after fw_domain_init(.., FW_DOMAIN_ID_BLITTER, ..); + intel_uncore_init(dev_priv); intel_uc_init_mmio(dev_priv); diff --git a/drivers/gpu/drm/i915/i915_drv.h b/drivers/gpu/drm/i915/i915_drv.h index 65e674668b2e..ba16c2025364 100644 --- a/drivers/gpu/drm/i915/i915_drv.h +++ b/drivers/gpu/drm/i915/i915_drv.h @@ -3438,6 +3438,7 @@ void i915_unreserve_fence(struct drm_i915_fence_reg *fence); void i915_gem_revoke_fences(struct drm_i915_private *dev_priv); void i915_gem_restore_fences(struct drm_i915_private *dev_priv); +void intel_device_info_init_mmio(struct drm_i915_private *dev_priv); void i915_gem_detect_bit_6_swizzle(struct drm_i915_private *dev_priv); void i915_gem_object_do_bit_17_swizzle(struct drm_i915_gem_object *obj, struct sg_table *pages); diff --git a/drivers/gpu/drm/i915/i915_reg.h b/drivers/gpu/drm/i915/i915_reg.h index b6cd725ff0b7..2b8d3a13dd27 100644 --- a/drivers/gpu/drm/i915/i915_reg.h +++ b/drivers/gpu/drm/i915/i915_reg.h @@ -2860,6 +2860,11 @@ enum i915_power_well_id { #define GEN10_EU_DISABLE3 _MMIO(0x9140) #define GEN10_EU_DIS_SS_MASK0xff +#define GEN11_GT_VEBOX_VDBOX_DISABLE _MMIO(0x9140) +#define GEN11_GT_VDBOX_DISABLE_MASK0xff +#define GEN11_GT_VEBOX_DISABLE_SHIFT 16 +#define GEN11_GT_VEBOX_DISABLE_MASK(0xff << GEN11_GT_VEBOX_DISABLE_SHIFT) + #define GEN6_BSD_SLEEP_PSMI_CONTROL _MMIO(0x12050) #define GEN6_BSD_SLEEP_MSG_DISABLE (1 << 0) #define GEN6_BSD_SLEEP_FLUSH_DISABLE(1 << 2) diff --git a/drivers/gpu/drm/i915/intel_device_info.c b/drivers/gpu/drm/i915/intel_device_info.c index 9352f34e75c4..7c8779faf162 100644 --- a/drivers/gpu/drm/i915/intel_device_info.c +++ b/drivers/gpu/drm/i915/intel_device_info.c @@ -595,3 +595,57 @@ void intel_driver_caps_print(const struct intel_driver_caps *caps, { drm_printf(p, "scheduler: %x\n", caps->scheduler); } + +/* + * Determine which engines are fused off in our particular hardware. + * + * This function needs to be called after the MMIO has been setup (as we need + * to read registers) but before uncore init (because the powerwell for the + * fused off engines doesn't exist, so we cannot initialize forcewake for them) + */
Re: [Intel-gfx] [PATCH 12/20] drm/i915/icl: Check for fused-off VDBOX and VEBOX instances
On Tue, 13 Feb 2018 17:37:30 +0100, Mika Kuoppalawrote: From: Oscar Mateo In Gen11, the Video Decode engines (aka VDBOX, aka VCS, aka BSD) and the Video Enhancement engines (aka VEBOX, aka VECS) could be fused off. Also, each VDBOX and VEBOX has its own power well, which only exist if the related engine exists in the HW. Unfortunately, we have a Catch-22 situation going on: we need to read an MMIO register with the fuse info, but we cannot fully enable MMIO until we read it (since we need the real engines to initialize the forcewake domains). We workaround this problem by reading the fuse after the MMIO is partially ready, but before we initialize forcewake. Bspec: 20680 v2: We were shifting incorrectly for vebox disable (Vinay) v3: Assert mmio is ready and warn if we have attempted to initialize forcewake for fused-off engines (Paulo) v4: - Use INTEL_GEN in new code (Tvrtko) - Shorter local variable (Tvrtko, Michal) - Keep "if (!...) continue" style (Tvrtko) - No unnecessary BUG_ON (Tvrtko) - WARN_ON and cleanup if wrong mask (Tvrtko, Michal) - Use I915_READ_FW (Michal) - Use I915_MAX_VCS/VECS macros (Michal) v5: Rebased by Rodrigo fixing conflicts on top of: commit 33def1ff7b0 ("drm/i915: Simplify intel_engines_init") v6: Fix v5. Remove info->num_rings. (by Oscar) v7: Rebase (Rodrigo). v8: - s/intel_device_info_fused_off_engines/intel_device_info_init_mmio (Chris) - Make vdbox_disable & vebox_disable local variables (Chris) Cc: Paulo Zanoni Cc: Vinay Belgaumkar Cc: Tvrtko Ursulin Cc: Michal Wajdeczko Cc: Chris Wilson Signed-off-by: Rodrigo Vivi Signed-off-by: Oscar Mateo --- drivers/gpu/drm/i915/i915_drv.c | 2 ++ drivers/gpu/drm/i915/i915_drv.h | 1 + drivers/gpu/drm/i915/i915_reg.h | 5 +++ drivers/gpu/drm/i915/intel_device_info.c | 54 4 files changed, 62 insertions(+) diff --git a/drivers/gpu/drm/i915/i915_drv.c b/drivers/gpu/drm/i915/i915_drv.c index 9380c9f69b0f..43b2f620bca7 100644 --- a/drivers/gpu/drm/i915/i915_drv.c +++ b/drivers/gpu/drm/i915/i915_drv.c @@ -1033,6 +1033,8 @@ static int i915_driver_init_mmio(struct drm_i915_private *dev_priv) if (ret < 0) goto err_bridge; + intel_device_info_init_mmio(dev_priv); + intel_uncore_init(dev_priv); intel_uc_init_mmio(dev_priv); diff --git a/drivers/gpu/drm/i915/i915_drv.h b/drivers/gpu/drm/i915/i915_drv.h index 65e674668b2e..ba16c2025364 100644 --- a/drivers/gpu/drm/i915/i915_drv.h +++ b/drivers/gpu/drm/i915/i915_drv.h @@ -3438,6 +3438,7 @@ void i915_unreserve_fence(struct drm_i915_fence_reg *fence); void i915_gem_revoke_fences(struct drm_i915_private *dev_priv); void i915_gem_restore_fences(struct drm_i915_private *dev_priv); +void intel_device_info_init_mmio(struct drm_i915_private *dev_priv); This function should be declared in "intel_device_info.h" void i915_gem_detect_bit_6_swizzle(struct drm_i915_private *dev_priv); void i915_gem_object_do_bit_17_swizzle(struct drm_i915_gem_object *obj, struct sg_table *pages); diff --git a/drivers/gpu/drm/i915/i915_reg.h b/drivers/gpu/drm/i915/i915_reg.h index b6cd725ff0b7..2b8d3a13dd27 100644 --- a/drivers/gpu/drm/i915/i915_reg.h +++ b/drivers/gpu/drm/i915/i915_reg.h @@ -2860,6 +2860,11 @@ enum i915_power_well_id { #define GEN10_EU_DISABLE3 _MMIO(0x9140) #define GEN10_EU_DIS_SS_MASK 0xff +#define GEN11_GT_VEBOX_VDBOX_DISABLE _MMIO(0x9140) +#define GEN11_GT_VDBOX_DISABLE_MASK0xff +#define GEN11_GT_VEBOX_DISABLE_SHIFT 16 +#define GEN11_GT_VEBOX_DISABLE_MASK (0xff << GEN11_GT_VEBOX_DISABLE_SHIFT) + Missing indent (2 spaces) for above bit fields definitions. #define GEN6_BSD_SLEEP_PSMI_CONTROL_MMIO(0x12050) #define GEN6_BSD_SLEEP_MSG_DISABLE (1 << 0) #define GEN6_BSD_SLEEP_FLUSH_DISABLE (1 << 2) diff --git a/drivers/gpu/drm/i915/intel_device_info.c b/drivers/gpu/drm/i915/intel_device_info.c index 9352f34e75c4..7c8779faf162 100644 --- a/drivers/gpu/drm/i915/intel_device_info.c +++ b/drivers/gpu/drm/i915/intel_device_info.c @@ -595,3 +595,57 @@ void intel_driver_caps_print(const struct intel_driver_caps *caps, { drm_printf(p, "scheduler: %x\n", caps->scheduler); } + +/* + * Determine which engines are fused off in our particular hardware. + * + * This function needs to be called after the MMIO has been setup (as we need + * to read registers) but before uncore init (because the powerwell for the + * fused off engines doesn't exist, so we cannot initialize forcewake for them) + */ +void intel_device_info_init_mmio(struct drm_i915_private *dev_priv) +{ + struct