Another question I had about __constant, was there seems to be no limit. I'm using __constant for every read-only parameter now totalling 1500Kb and this test now runs in 32ms. So, is there a limit? Is this method reliable? Can driver do this implicitly on all read-only buffers? thanks
On Tue Nov 25 2014 at 2:11:26 PM Tony Moore <[email protected]> wrote: > Hello, > I notice that reads are not being combined when I use __constant on a > read-only kernel buffer. Is this something that can be improved? > > In my kernel there are many loads from a read-only data structure. When I > use the __global specifier for the memory space I see a total of 33 send > instructions and a runtime of 81ms. When I use the __constant specifier, I > see 43 send instructions and a runtime of 40ms. I'm hoping that combining > the loads could improve performance further. > > thanks! > tony >
_______________________________________________ Beignet mailing list [email protected] http://lists.freedesktop.org/mailman/listinfo/beignet
