On Tue, 2026-06-02 at 16:33 +0100, Tvrtko Ursulin wrote:
> Commit
> 28c5bf28763d ("drm/sched: Disallow initializing entities with no schedulers")
> failed to notice clearing of entity->rq in drm_sched_entity_init() is now

By clearing you also mean the setting to NULL?

I'd just use "initialization" consistently, like in the commit title.

> redundant and can be removed.
> 
> Given that entity->rq can now never be NULL, we also remove two impossible
> checks, from drm_sched_entity_kill() and drm_sched_entity_flush()
> respectively.
> 
> Similarly, we can also remove the !entity->rq check in
> drm_sched_job_init(). And for the better, given that the error message, if
> it ever triggered, would have dereferenced the yet un-initialized job->
> sched (only initialized later in drm_sched_job_arm()). This appears to
> have been theoretically broken ever since commit
> 56e449603f0a ("drm/sched: Convert the GPU scheduler to variable number of 
> run-queues")
> .
> 
> Signed-off-by: Tvrtko Ursulin <[email protected]>
> Cc: Christian König <[email protected]>
> Cc: Danilo Krummrich <[email protected]>
> Cc: Matthew Brost <[email protected]>
> Cc: Philipp Stanner <[email protected]>
> ---
>  drivers/gpu/drm/scheduler/sched_entity.c | 11 ++---------
>  drivers/gpu/drm/scheduler/sched_main.c   |  9 ---------
>  2 files changed, 2 insertions(+), 18 deletions(-)
> 
> diff --git a/drivers/gpu/drm/scheduler/sched_entity.c 
> b/drivers/gpu/drm/scheduler/sched_entity.c
> index 4ebb513255ed..c51101ec70c1 100644
> --- a/drivers/gpu/drm/scheduler/sched_entity.c
> +++ b/drivers/gpu/drm/scheduler/sched_entity.c
> @@ -129,7 +129,6 @@ int drm_sched_entity_init(struct drm_sched_entity *entity,
>               return -ENOMEM;
>  
>       INIT_LIST_HEAD(&entity->list);
> -     entity->rq = NULL;

It would seem that has always been redundant because of the memset(0)
directly above.

>       entity->guilty = guilty;
>       entity->priority = priority;
>       entity->last_user = current->group_leader;
> @@ -280,9 +279,6 @@ void drm_sched_entity_kill(struct drm_sched_entity 
> *entity)
>       struct drm_sched_job *job;
>       struct dma_fence *prev;
>  
> -     if (!entity->rq)
> -             return;
> -
>       spin_lock(&entity->lock);
>       entity->stopped = true;
>       drm_sched_rq_remove_entity(entity->rq, entity);
> @@ -329,14 +325,11 @@ EXPORT_SYMBOL(drm_sched_entity_kill);
>   */
>  long drm_sched_entity_flush(struct drm_sched_entity *entity, long timeout)
>  {
> -     struct drm_gpu_scheduler *sched;
> +     struct drm_gpu_scheduler *sched =
> +             container_of(entity->rq, typeof(*sched), rq);
>       struct task_struct *last_user;
>       long ret = timeout;
>  
> -     if (!entity->rq)
> -             return 0;
> -
> -     sched = container_of(entity->rq, typeof(*sched), rq);
>       /*
>        * The client will not queue more jobs during this fini - consume
>        * existing queued ones, or discard them on SIGKILL.
> diff --git a/drivers/gpu/drm/scheduler/sched_main.c 
> b/drivers/gpu/drm/scheduler/sched_main.c
> index 818d3d4434b5..d2ca01b31ee4 100644
> --- a/drivers/gpu/drm/scheduler/sched_main.c
> +++ b/drivers/gpu/drm/scheduler/sched_main.c
> @@ -588,15 +588,6 @@ int drm_sched_job_init(struct drm_sched_job *job,
>                      u32 credits, void *owner,
>                      uint64_t drm_client_id)
>  {
> -     if (!entity->rq) {
> -             /* This will most likely be followed by missing frames
> -              * or worse--a blank screen--leave a trail in the
> -              * logs, so this can be debugged easier.
> -              */
> -             dev_err(job->sched->dev, "%s: entity has no rq!\n", __func__);
> -             return -ENOENT;
> -     }
> -
>       if (unlikely(!credits)) {
>               pr_err("*ERROR* %s: credits cannot be 0!\n", __func__);
>               return -EINVAL;

But overall a very nice cleanup


P.

Reply via email to