On Fri, May 13, 2016 at 3:37 AM, Bas Nieuwenhuizen
<b...@basnieuwenhuizen.nl> wrote:
> Using more than 1 wave per threadgroup does increase performance
> generally.  Not using too many patches per threadgroup also
> increases performance. Both catalyst and amdgpu-pro seem to
> use 40 patches as their maximum, but I haven't really seen
> any performance increase from limiting the number of patches
> to 40 instead of 64.

40 may be optimal for existing OpenGL apps on some chips.

Vulkan doesn't set more than 16.

Let's set either 40 or 16 with a comment where the value comes from.

>
> Note that the trick where we overlap the input and output LDS
> does not work anymore as the insertion of the tess factors
> changes the patch stride.

I don't understand this. Can you explain it more?

Marek
_______________________________________________
mesa-dev mailing list
mesa-dev@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/mesa-dev

Reply via email to