On Wednesday, May 13, 2026 1:26 PM Aaron Kling wrote:
> On Tue, May 12, 2026 at 10:26 PM Mikko Perttunen <[email protected]> 
> wrote:
> >
> > On Tuesday, May 12, 2026 2:29 PM Aaron Kling wrote:
> > > There is an issue with tegra-drm where some buffers get created, then
> > > freed, but the dma buffer never gets freed. Causing display controller
> > > memory allocations to start failing after the leaks fill up cma.
> > >
> > > I created an issue on the freedesktop issue tracker [0] with a patch
> > > with some debug logs I added, then a log from Android that contains
> > > these logs. CMA is set to 512MB, and when allocations start to fail,
> > > the unfreed allocations add up to just shy of 500MB, where it's
> > > reasonable to expect that 8MB contiguous is no longer available. The
> > > log was generated on a Jetson TX2 NX, but I have seen this leak on
> > > other archs as well, this also does not appear to be limited to soc's
> > > with nvdisplay.
> > >
> > > This does not appear to be a userspace issue. The graphics allocator
> > > works as expected for other soc vendors. And as the logs show, the
> > > delete dumb buffer ioctl is called, but is not always followed by the
> > > dma buffer getting freed. I have also observed this issue with a
> > > gralloc that uses the tegra gem create and such, this is not unique to
> > > dumb buffers, that's just the last log I had when deciding to post the
> > > issue to lkml.
> > >
> > > What I primarily intend to ask here is how to further debug this
> > > issue. I'm not finding any direct path between the delete dumb ioctl
> > > handling and gem release or tegra bo free. Can someone point me to the
> > > pieces in the middle I'm missing, where the logic is to decide is a
> > > buffer should be freed?
> > >
> > > Aaron
> > >
> > > [0] https://gitlab.freedesktop.org/drm/tegra/-/work_items/9
> > >
> >
> > If the issue is specific to buffers that get used with display, I have
> > an idea of what the issue is -- there is some circular reference
> > counting with the BO cache in the host1x driver, and that means that
> > BOs that end up in the cache never get released.
> 
> As far as I know, this only affects display controller buffers. Though
> unfortunately, I have limited ways to test the media engines right
> now.

I've been working on some more userspace for the media engines.
Hopefully I can get that in shape soon.

> 
> > Let me do some testing locally and I'll send out a patch once ready.
> 
> Sounds good, thanks.

I posted a fix, please give it a try. Incidentally, on my side I don't
have that much testing set up for the display :)

Mikko

> 
> Aaron



Reply via email to