Hi Mikhail,
> Subject: [PATCH] dma-buf/udmabuf: skip redundant cpu sync to fix
> cacheline EEXIST warning
>
> When CONFIG_DMA_API_DEBUG_SG is enabled, importing a udmabuf
> into a DRM
> driver (e.g. amdgpu for video playback in GNOME Videos / Showtime)
> triggers a spurious warning:
>
> DMA-API: amdgpu 0000:03:00.0: cacheline tracking EEXIST, \
> overlapping mappings aren't supported
> WARNING: kernel/dma/debug.c:619 at add_dma_entry+0x473/0x5f0
>
> The call chain is:
>
> amdgpu_cs_ioctl
> -> amdgpu_ttm_backend_bind
> -> dma_buf_map_attachment
> -> [udmabuf] map_udmabuf -> get_sg_table
> -> dma_map_sgtable(dev, sg, direction, 0) // attrs=0
> -> debug_dma_map_sg -> add_dma_entry -> EEXIST
>
> This happens because udmabuf builds a per-page scatter-gather list via
> sg_set_folio(). When begin_cpu_udmabuf() has already created an sg
> table mapped for the misc device, and an importer such as amdgpu
> maps
> the same pages for its own device via map_udmabuf(), the DMA debug
> infrastructure sees two active mappings whose physical addresses share
> cacheline boundaries and warns about the overlap.
>
> The DMA_ATTR_SKIP_CPU_SYNC flag suppresses this check in
> add_dma_entry() because it signals that no CPU cache maintenance is
> performed at map/unmap time, making the cacheline overlap harmless.
>
> All other major dma-buf exporters already pass this flag:
> - drm_gem_map_dma_buf() passes DMA_ATTR_SKIP_CPU_SYNC
> - amdgpu_dma_buf_map() passes DMA_ATTR_SKIP_CPU_SYNC
>
> The CPU sync at map/unmap time is also redundant for udmabuf:
> begin_cpu_udmabuf() and end_cpu_udmabuf() already perform explicit
> cache synchronization via dma_sync_sgtable_for_cpu/device() when
> CPU
> access is requested through the dma-buf interface.
>
> Pass DMA_ATTR_SKIP_CPU_SYNC to dma_map_sgtable() and
> dma_unmap_sgtable() in udmabuf to suppress the spurious warning
> and
> skip the redundant sync.
>
> Fixes: 284562e1f348 ("udmabuf: implement
> begin_cpu_access/end_cpu_access hooks")
> Cc: [email protected]
> Signed-off-by: Mikhail Gavrilov <[email protected]>
> ---
> drivers/dma-buf/udmabuf.c | 4 ++--
> 1 file changed, 2 insertions(+), 2 deletions(-)
>
> diff --git a/drivers/dma-buf/udmabuf.c b/drivers/dma-buf/udmabuf.c
> index 94b8ecb892bb..9c6f8785a28a 100644
> --- a/drivers/dma-buf/udmabuf.c
> +++ b/drivers/dma-buf/udmabuf.c
> @@ -162,7 +162,7 @@ static struct sg_table *get_sg_table(struct device
> *dev, struct dma_buf *buf,
> sg_set_folio(sgl, ubuf->folios[i], PAGE_SIZE,
> ubuf->offsets[i]);
>
> - ret = dma_map_sgtable(dev, sg, direction, 0);
> + ret = dma_map_sgtable(dev, sg, direction,
> DMA_ATTR_SKIP_CPU_SYNC);
> if (ret < 0)
> goto err_map;
> return sg;
> @@ -177,7 +177,7 @@ static struct sg_table *get_sg_table(struct device
> *dev, struct dma_buf *buf,
> static void put_sg_table(struct device *dev, struct sg_table *sg,
> enum dma_data_direction direction)
> {
> - dma_unmap_sgtable(dev, sg, direction, 0);
> + dma_unmap_sgtable(dev, sg, direction,
> DMA_ATTR_SKIP_CPU_SYNC);
Looks OK to me but it would be nice if Christian or someone else can
provide an Ack for this patch.
Thanks,
Vivek
> sg_free_table(sg);
> kfree(sg);
> }
> --
> 2.53.0