Hello Dmitry,

thanks for reviewing!

On Mon, 23 Jun 2025 14:06:43 +0300
Dmitry Baryshkov <dmitry.barysh...@oss.qualcomm.com> wrote:

> On 20/06/2025 18:59, Luca Ceresoli wrote:
> > To the best of my knowledge, all drivers in the mainline kernel adding a
> > DRM bridge are now converted to using devm_drm_bridge_alloc() for
> > allocation and initialization. Among others this ensures initialization of
> > the bridge refcount, allowing dynamic allocation lifetime.
> > 
> > devm_drm_bridge_alloc() is now mandatory for all new bridges. Code using
> > the old pattern ([devm_]kzalloc + filling the struct fields +
> > drm_bridge_add) is not allowed anymore.
> > 
> > Any drivers that might have been missed during the conversion, patches in
> > flight towards mainline and out-of-tre drivers still using the old pattern
> > will already be caught by a warning looking like:
> > 
> >    ------------[ cut here ]------------
> >    refcount_t: addition on 0; use-after-free.
> >    WARNING: CPU: 2 PID: 83 at lib/refcount.c:25 
> > refcount_warn_saturate+0x120/0x148
> >    [...]
> >    Call trace:
> >     refcount_warn_saturate+0x120/0x148 (P)
> >     drm_bridge_get.part.0+0x70/0x98 [drm]
> >     drm_bridge_add+0x34/0x108 [drm]
> >     sn65dsi83_probe+0x200/0x480 [ti_sn65dsi83]
> >     [...]
> > 
> > This warning comes from the refcount code and happens because
> > drm_bridge_add() is increasing the refcount, which is uninitialized and
> > thus initially zero.
> > 
> > Having a warning and the corresponding stack trace is surely useful, but
> > the warning text does not clarify the root problem nor how to fix it.
> > 
> > Add a DRM_WARN() just before increasing the refcount, so the log will be
> > much more readable:
> > 
> >    [drm] DRM bridge corrupted or not allocated by devm_drm_bridge_alloc()
> >    ------------[ cut here ]------------
> >    refcount_t: addition on 0; use-after-free.
> >    [...etc...]
> > 
> > A DRM_WARN is used because drm_warn and drm_WARN require a struct
> > drm_device pointer which is not yet available when adding a bridge.
> > 
> > Do not print the dev_name() in the warning because struct drm_bridge has no
> > pointer to the struct device. The affected driver should be easy to catch
> > based on the following stack trace however.
> > 
> > Signed-off-by: Luca Ceresoli <luca.ceres...@bootlin.com>
> > 
> > ---
> > 
> > Changes in v9:
> > - change warning trigger from "refcount != 1" to "container not NULL"
> > 
> > This patch was added in v8
> > ---
> >   drivers/gpu/drm/drm_bridge.c | 3 +++
> >   1 file changed, 3 insertions(+)
> > 
> > diff --git a/drivers/gpu/drm/drm_bridge.c b/drivers/gpu/drm/drm_bridge.c
> > index 
> > f001bbe95559aabf0aac9f25f89250ad4e1ad9c8..0b450b334afd82e0460f18fdd248f79d0a2b153d
> >  100644
> > --- a/drivers/gpu/drm/drm_bridge.c
> > +++ b/drivers/gpu/drm/drm_bridge.c
> > @@ -295,6 +295,9 @@ EXPORT_SYMBOL(__devm_drm_bridge_alloc);
> >    */
> >   void drm_bridge_add(struct drm_bridge *bridge)  
> 
> Can we rename this function, making sure that noone can call it 
> directly? E.g. adding two underscores might point out that is is an 
> internal API.

I'm afraid I don't think this would be correct. Every bridge driver is
expected to call drm_bridge_add() at the end of probe/initialization,
to make the DRM subsystem aware that this bridge is ready for being
used.

The point of this patch, which is a completion to [1], is to ensure
that all drivers use the "new" process:

 1. bridge = devm_drm_bridge_alloc()
 2. drm_bridge_add(bridge)

and there are no users of the old process:

 1. bridge = [devm_]kzalloc()
 2. drm_bridge_add(bridge)

Does this addresses your concern?

[1] 
https://cgit.freedesktop.org/drm/drm-misc/commit/?id=b3f13e00a8de351832df7d628a15ca4db49ca94f

Luca

-- 
Luca Ceresoli, Bootlin
Embedded Linux and Kernel engineering
https://bootlin.com

Reply via email to