From: Karol Herbst
helps shaders in multiple games
total instructions in shared programs : 1925865 -> 1922112 (-0.19%)
total gprs used in shared programs: 251863 -> 251863 (0.00%)
total local used in shared programs : 5673 -> 5673 (0.00%)
total bytes used in shared
Please make this work for all chips. I don't want to have these
partial optimizations in place. The reason it was OK for nv50 is that
I thought this only ever applied to nv50, didn't realize that (a) we
didn't have FFMA32I not hooked up and (b) that it had this
restriction. Also you'd want to drop