Paul Brook wrote:
Lines =  2863
Other  50 =    1.7%     (These are branch targets)
Branch 80 =    2.8%
Scalar 1008 = 35.2%
Vector 1725 = 60.3%

The only argument I've heard so far in favour of MIMD is that it
would improve the performance for scalar workloads. For a graphics
chip, that doesn't appear to be a smart approach.

By your own numbers conventional 4-element SIMD gets <70% ALU utilization.
not necessarily because it's a static analysis.

percentage of instructions does not mean utilisation.
For example a control instruction is not used as often
as others, depending on the program structure.

To me it sounds like :
a GPU can use LIW instructions with 2 vector fields and one scalar/control 
field.
It would fit the static analysis numbers.

However a dynamic analysis could uncover other more interesting trends.

That leaves quite a lot of scope for improvement, be it full MIMD or several ALU threads controlled by a single dispatch unit (ala. Larabee SIMD with software thread combining).

If no improvement was possible, we wouldn't be speaking here ;-)
Now the problem is to limit the scope enough to get a decent,
simple, flexible model, keeping in mind that 100% ALU usage is not possible.
90% or 95% is good enough, however, but the most difficult is the first %s ...

regards,

Paul
yg

_______________________________________________
Open-graphics mailing list
[email protected]
http://lists.duskglow.com/mailman/listinfo/open-graphics
List service provided by Duskglow Consulting, LLC (www.duskglow.com)

Reply via email to