Hello,

Here's a respin of my SIMD8 VS input refactors, new GS input refactors,
and finally the long-awaited SIMD8 GS support.

Patches 1-10 are input refactoring.  Patches 11-23 are SIMD8 GS support.

To try out SIMD8 GS, set the INTEL_SCALAR_GS=1 environment variable.
Note that the feature requires Gen8+, and I've only tested Broadwell.
Piglit passes with no regressions; I haven't had a chance to try dEQP
yet.  Synmark works.  However, Orbital Explorer causes the GPU to hang.
I'm not sure why yet; the shader is huge.  I'll have to look into it.

Future work:
- vec4 VS input refactors (I have a patch, but there are a few bugs left)
- vec4 GS input refactors (I think we can simplify more, I just did the
  bare minimum to keep it working across the nir_lower_io rework, and
  proved that it worked the way I wanted in my new backend...)
- SIMD8 GS optimizations?  Orbital Explorer is hurt badly - it goes from
  2977 -> 7304 instructions, with 380 spills/1473 fills.
- Tune the push/pull split point (I arbitrarily chose 32 registers).

--Ken

_______________________________________________
mesa-dev mailing list
mesa-dev@lists.freedesktop.org
http://lists.freedesktop.org/mailman/listinfo/mesa-dev

Reply via email to