> Lines = 2863 > Other 50 = 1.7% (These are branch targets) > Branch 80 = 2.8% > Scalar 1008 = 35.2% > Vector 1725 = 60.3% > > The only argument I've heard so far in favour of MIMD is that it > would improve the performance for scalar workloads. For a graphics > chip, that doesn't appear to be a smart approach.
By your own numbers conventional 4-element SIMD gets <70% ALU utilization. That leaves quite a lot of scope for improvement, be it full MIMD or several ALU threads controlled by a single dispatch unit (ala. Larabee SIMD with software thread combining). Paul _______________________________________________ Open-graphics mailing list [email protected] http://lists.duskglow.com/mailman/listinfo/open-graphics List service provided by Duskglow Consulting, LLC (www.duskglow.com)
