> Lines =  2863
> Other  50 =    1.7%   (These are branch targets)
> Branch 80 =    2.8%
> Scalar 1008 = 35.2%
> Vector 1725 = 60.3%
> 
> The only argument I've heard so far in favour of MIMD is that it
> would improve the performance for scalar workloads. For a graphics
> chip, that doesn't appear to be a smart approach.

By your own numbers conventional 4-element SIMD gets <70% ALU utilization. 
That leaves quite a lot of scope for improvement, be it full MIMD or several 
ALU threads controlled by a single dispatch unit (ala. Larabee SIMD with 
software thread combining).

Paul
_______________________________________________
Open-graphics mailing list
[email protected]
http://lists.duskglow.com/mailman/listinfo/open-graphics
List service provided by Duskglow Consulting, LLC (www.duskglow.com)

Reply via email to