Re: Does gbm_bo_map() implicitly synchronise?

Christian König Tue, 25 Jun 2024 00:40:46 -0700

Am 24.06.24 um 21:08 schrieb James Jones:

FWIW, the NVIDIA binary driver's implementation of gbm_bo_map/unmap()
1) Don't do any synchronization against in-flight work. The assumptionis that if the content is going to be read, the API writing the datahas established that coherence. Likewise, if it's going to be written,the API reading it afterwards does any invalidates or whatever areneeded for coherence.

That matches my assumption of what this function does, but is just theopposite of what Michel explained what it does.

Is it somewhere documented if gbm_bo_map() should wait for in-flightwork or not?


Regards,
Christian.

2) We don't blit anything or format convert, because our GBMimplementation has no DMA engine access, and I'd like to keep it thatway. Setting up a DMA-capable driver instance is much more expensiveas far as runtime resources than setting up a simple allocator+mmapdriver, at least in our driver architecture. Our GBM map just does anmmap(), and if it's not linear, you're not going to be able tointerpret the data unless you've read up on our tiling formats. I'maware this is different from Mesa, and no one has complained thus far.If we were forced to fix it, I imagine we'd do something like ask ashared engine in the kernel to do the blit on userspace's behalf,which would probably be slow but save resources.
Basically, don't use gbm_bo_map() for anything non-trivial on ourimplementation. It's not the right tool for e.g., reading back orpopulating OpenGL textures or X pixmaps. If you don't want to run onthe NV implementation, feel free to ignore this advice, but I'd stillsuggest it's not the best tool for most jobs.
Thanks,
-James

On 6/17/24 03:29, Pierre Ossman wrote:
On 17/06/2024 10:13, Christian König wrote:
Let me try to clarify a couple of things:
The DMA_BUF_IOCTL_SYNC function is to flush and invalidate caches sothat the GPU can see values written by the CPU and the CPU can seevalues written by the GPU. But that IOCTL does *not* wait for anyasync GPU operation to finish.
If you want to wait for async GPU operations you either need to callthe OpenGL functions to read pixels or do a select() (or poll, epolletc...) call on the DMA-buf file descriptor.
Thanks for the clarification!
Just to avoid any uncertainty, are both of these things doneimplicitly by gbm_bo_map()/gbm_bo_unmap()?
I did test adding those steps just in case, but unfortunately did notsee an improvement. My order was:
1. gbm_bo_import(GBM_BO_USE_RENDERING)
2. gbm_bo_get_fd()
3. Wait for client to request displaying the buffer
4. gbm_bo_map(GBM_BO_TRANSFER_READ)
5. select(fd+1, &fds, NULL, NULL, NULL)
6. ioctl(DMA_BUF_IOCTL_SYNC, &{ .flags = DMA_BUF_SYNC_START |DMA_BUF_SYNC_READ })
7. pixman_blt()
8. gbm_bo_unmap()
So if you want to do some rendering with OpenGL and then see theresult in a buffer memory mapping the correct sequence would be thefollowing:
1. Issue OpenGL rendering commands.
2. Call glFlush() to make sure the hw actually starts working on therendering.3. Call select() on the DMA-buf file descriptor to wait for therendering to complete.
4. Use DMA_BUF_IOCTL_SYNC to make the rendering result CPU visible.
What I want to do is implement the X server side of DRI3 in just CPU.It works for every application I've tested except gnome-shell.
I would assume that 1. and 2. are supposed to be done by the Xclient, i.e. gnome-shell?
What I need to be able to do is access the result of that, once the Xclient tries to draw using that GBM backed pixmap (e.g. usingPresentPixmap).
So far, we've only tested Intel GPUs, but we are setting up Nvidiaand AMD GPUs at the moment. It will be interesting to see if theissue remains on those or not.
Regards

Re: Does gbm_bo_map() implicitly synchronise?

Reply via email to