Re: drm/exynos: g2d userptr memory corruption

2015-08-17 Thread Lucas Stach
Hi Tobias,

Am Sonntag, den 16.08.2015, 14:48 +0200 schrieb Tobias Jakobi:
 Hello,
 
 some time ago I checked whether I could use the userptr functionality to
 do zero-copy from userspace allocated buffers via the G2D. This didn't
 work out so well, so kinda put this to the bottom of my TODO list.
 
 Now that IOMMU support has landed and Jan Kara has rewrote page pinning
 using frame vectors (see [1]) I gave userptr another try.
 
 The results are much better. I'm not experiencing any kernel lockups or
 sysmmu pagefaults anymore. However the image now suffers from visual
 artifacts. These images show the nature of the artifacts:
 http://i.imgur.com/nzT6g3Y.jpg
 http://i.imgur.com/wkuYI6X.jpg
 
 The corruption always manifests itself in these pixel lines of fixed
 size and wrong color.
 
 I have written a testcase as part of libdrm for this issue:
 https://github.com/tobiasjakobi/libdrm/commit/db8bf6844436598251f67a71fc334b929bfb2b71
 
 It allocates N (N an even number) buffers which are aligned to the
 system pagesize. Then it does this each iteration:
 1) Fill the first N/2 buffers with random data
 2) Copy the first half to the second half of the buffers
 3) memcmp() first and second half (verification pass)
 
 Usually this verification already fails on the first iteration. An
 interesting observation is that increasing (!) the buffer size (so the
 amount of pixels that have to copied per buffer grows) makes this issue
 less likely to happen.
 
 With the default 512x512 buffers however it happens, like I said above,
 almost immediately.
 
This is obviously a cache flush missing. The memory you get from
userspace is normal cached memory, so to make it visible to the GPU you
need to flush parts of the cache out to main memory.

The corruption you are seeing is just unflushed cachelines. This also
explains why increasing the buffer size helps: the more memory the CPU
touches the more cachelines will be flushed out to be replaced with new
data.

So you need to go and have a look at dma_map() and dma_sync_*_for_*()
and friends.

Regards,
Lucas
-- 
Pengutronix e.K. | Lucas Stach |
Industrial Linux Solutions   | http://www.pengutronix.de/  |


--
--
___
Dri-devel mailing list
Dri-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dri-devel


Re: drm/exynos: g2d userptr memory corruption

2015-08-17 Thread Tobias Jakobi
Thanks Lucas for the explanation!


Lucas Stach wrote:
 Hi Tobias,
 
 Am Sonntag, den 16.08.2015, 14:48 +0200 schrieb Tobias Jakobi:
 Hello,

 some time ago I checked whether I could use the userptr functionality to
 do zero-copy from userspace allocated buffers via the G2D. This didn't
 work out so well, so kinda put this to the bottom of my TODO list.

 Now that IOMMU support has landed and Jan Kara has rewrote page pinning
 using frame vectors (see [1]) I gave userptr another try.

 The results are much better. I'm not experiencing any kernel lockups or
 sysmmu pagefaults anymore. However the image now suffers from visual
 artifacts. These images show the nature of the artifacts:
 http://i.imgur.com/nzT6g3Y.jpg
 http://i.imgur.com/wkuYI6X.jpg

 The corruption always manifests itself in these pixel lines of fixed
 size and wrong color.

 I have written a testcase as part of libdrm for this issue:
 https://github.com/tobiasjakobi/libdrm/commit/db8bf6844436598251f67a71fc334b929bfb2b71

 It allocates N (N an even number) buffers which are aligned to the
 system pagesize. Then it does this each iteration:
 1) Fill the first N/2 buffers with random data
 2) Copy the first half to the second half of the buffers
 3) memcmp() first and second half (verification pass)

 Usually this verification already fails on the first iteration. An
 interesting observation is that increasing (!) the buffer size (so the
 amount of pixels that have to copied per buffer grows) makes this issue
 less likely to happen.

 With the default 512x512 buffers however it happens, like I said above,
 almost immediately.

 This is obviously a cache flush missing. The memory you get from
 userspace is normal cached memory, so to make it visible to the GPU you
 need to flush parts of the cache out to main memory.
 
 The corruption you are seeing is just unflushed cachelines. This also
 explains why increasing the buffer size helps: the more memory the CPU
 touches the more cachelines will be flushed out to be replaced with new
 data.
I should point out that the snapshots I uploaded were done with a
different setup. There only the source memory of the G2D operation is a
userspace allocated buffer. The destination is a GEM buffer allocated
through libdrm, which is then used as framebuffer. So the issue already
appears when just the source is userspace allocated.

What works however is an operation between GEM to GEM. However this
might be related to the default allocation flags libdrm uses.



 So you need to go and have a look at dma_map() and dma_sync_*_for_*()
 and friends.
 
 Regards,
 Lucas
 


With best wishes,
Tobias



--
--
___
Dri-devel mailing list
Dri-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dri-devel


drm/exynos: g2d userptr memory corruption

2015-08-16 Thread Tobias Jakobi
Hello,

some time ago I checked whether I could use the userptr functionality to
do zero-copy from userspace allocated buffers via the G2D. This didn't
work out so well, so kinda put this to the bottom of my TODO list.

Now that IOMMU support has landed and Jan Kara has rewrote page pinning
using frame vectors (see [1]) I gave userptr another try.

The results are much better. I'm not experiencing any kernel lockups or
sysmmu pagefaults anymore. However the image now suffers from visual
artifacts. These images show the nature of the artifacts:
http://i.imgur.com/nzT6g3Y.jpg
http://i.imgur.com/wkuYI6X.jpg

The corruption always manifests itself in these pixel lines of fixed
size and wrong color.

I have written a testcase as part of libdrm for this issue:
https://github.com/tobiasjakobi/libdrm/commit/db8bf6844436598251f67a71fc334b929bfb2b71

It allocates N (N an even number) buffers which are aligned to the
system pagesize. Then it does this each iteration:
1) Fill the first N/2 buffers with random data
2) Copy the first half to the second half of the buffers
3) memcmp() first and second half (verification pass)

Usually this verification already fails on the first iteration. An
interesting observation is that increasing (!) the buffer size (so the
amount of pixels that have to copied per buffer grows) makes this issue
less likely to happen.

With the default 512x512 buffers however it happens, like I said above,
almost immediately.

I first suspected that the clock rate of the G2D was too high (I
overclock the engine from 200MHz to 400MHz here), but even with the
default clock there is no change to the behaviour.

While looking at the issue I remember this discussion [2] so while ago.

Adding Marek to Cc since I guess that this could be related to the IOMMU
as well (some missing flushing?).


With best wishes,
Tobias


[1] http://www.spinics.net/lists/linux-samsung-soc/msg45931.html
[2] http://lists.freedesktop.org/archives/dri-devel/2014-July/062675.html

--
--
___
Dri-devel mailing list
Dri-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dri-devel