Re: [Mesa-dev] Representing explicit memory layouts in NIR

Timothy Arceri Fri, 30 Nov 2018 14:40:15 -0800

On 1/12/18 9:11 am, Jason Ekstrand wrote:

All,
This week, I've been working on trying to move UBO and SSBO access inNIR over to deref instructions. I'm hoping that this will allow us tostart doing alias analysis and copy-propagation on it. The passes wehave in NIR *should* be able to work with SSBOs as long asnir_compare_derefs does the right thing.
# A story about derefs
In that effort, I've run into a bit of a snag with how to represent thelayout information. What we get in from SPIR-V for Vulkan is a byteoffset for every struct member and a byte stride for every array (andpointer in the OpPtrAccessChain case). For matrices, there is anadditional RowMajor boolean we need to track somewhere. With OpenCLmemory access, you don't get these decorations but it's trivial totranslate the OpenCL layout (It's the same as C) to offset/stride whencreating the type. I've come up with three different ways to representthe information and they all have their own downsides:
## 1. Put the information on the glsl_type similar to how it's done inSPIR-V
This has the advantage of being fairly non-invasive to glsl_type. A lotof the fields we need are already there and the only real change is toallow array types to have strides. The downside is that the informationis often not where you want. Arrays and structs are ok but, formatrices, you have to go fishing all the way back to the struct type toget the RowMajor and MatrixStride decorations. (Thanks, SPIR-V...)While this seems like a local annoyance, it actually destroys basicallyall the advantages of having the information on the type and makeslower_io a real pain.
## 2. Put the information on the type but do it properly
In this version, we would put the matrix stride and RowMajor decorationdirectly on the matrix type. One obvious advantage here is that itmeans no fishing for matrix type information. Another is that, byhaving the types specialized like this, the only way to change layoutsmid-deref-chain would be to have a cast. Option 1 doesn't provide thisbecause matrix types are the same regardless of whether or not they'redeclared RowMajor in the struct. The downside to this option is that itrequires glsl_type surgery to make it work. More on that in a bit.
## 3. Put the information directly on the deref
Instead of putting the stride/offset information on the type, we justput it on the deref as we build the deref chain. This is easy enough todo in spirv_to_nir and someone could also do it easily enough as alowering pass based on a type_size function. This has the advantage ofsimplicity because you don't have to modify glsl_type at all andlowering is stupid-easy because all the information you need is rightthere on the deref. The downside, however, is that you alias analysisis potentially harder because you don't have the nice guarantee that youdon't see a layout change without a type cast. The other downside isthat we can't ever use copy_deref with anything bigger than a vectorbecause you don't know the sizes of any types and, unless spirv_to_nirputs the offset/stride information on the deref, there's now way toreconstruct it.
I've prototyped both 1 and 3 so far and I definitely like 3 better than1 but it's not great. I haven't prototyped 2 yet due to the issuementioned with glsl_type.
Between 2 and 3, I really don't know how much we actually loose in termsof our ability to do alias analysis. I've written the alias analysisfor 3 and it isn't too bad. I'm also not sure how much we wouldactually loose from not being able to express whole-array orwhole-struct copies. However, without a good reason otherwise, option 2really seems like it's the best of all worlds....
# glsl_type surgery

You want a good reason, eh?  You should have known this was coming...
The problem with option 2 above is that it requires significantglsl_type surgery to do it. Putting decorations on matrices violatesone of the core principals of glsl_type, namely that all fundamentaltypes: scalars, vectors, matrices, images, and samplers are singletons.Other types such as structs and arrays we build on-the-fly and cacheas-needed. In order to do what we need for option 2 above, you have toat least drop this for matrices and possibly vectors (the columns of arow-major mat4 are vectors with a stride of 16). Again, I see two options:
## A. Major rework of the guts of glsl_type
Basically, get rid of the static singletons and just use the buildon-the-fly and cache model for everything. This would mean that mat4 ==mat4 is no longer guaranteed unless you know a priori that none of yourtypes are decorated with layout information. It would also be, not onlya pile of work, but a single mega-patch. I don't know of any way tomake that change without just ripping it all up and putting it backtogether.

Do we really need to throw away the singleton model? Could we not addanother type on top of matrices to hold the layout information much likehow we handle arrays and structs and just strip it off (like we often dowith arrays) when needed for comparisons?


It's possible this could be messy, just trying to throw so ideas out there.

## B. Make a new nir_type and make NIR use it
This seems a bit crazy at this point. src/compiler/nir itself has over200 references to glsl_type and that doesn't include back-ends. It'd bea major overhaul and it's not clear that it's worth it. However, itwould mean that we'd have a chance to rewrite types and maybe do itbetter. Basing it on nir_alu_type instead of glsl_base_type would bereally nice because nir_alu_type already has an orthogonal split betweenbit size and format (float, uint, etc.). I would also likely structureit like vtn_type which has a different base_type concept which I findworks better than glsl_base_type.
Of course, A would be less invasive than B but B would give us thechance to improve some things without rewriting quite as many levels ofthe compiler. There are a number of things I think we could do betterbut changing those in the GLSL compiler would be a *lot* of workespecially since it doesn't use the C helpers that NIR does. On theother hand, the churn in NIR from introducing a new type data structurewould be pretty big. I did a quick git grep and it looks like most ofthe back-ends make pretty light use of glsl_type when it consuming NIRso maybe it wouldn't be that bad?
Thoughts?  Questions?  Objections?

--Jason

_______________________________________________
mesa-dev mailing list
mesa-dev@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/mesa-dev

Re: [Mesa-dev] Representing explicit memory layouts in NIR

Reply via email to