Hi Timothy,
I think I've got enough of a handle on the design to attempt a budget
for the dedicated multipliers. Assumptions:
- Four clocks to set up each span
- Interpolate 16 parameters (don't interpolate Z, yes I know it's
currently planned otherwise)
- Assume fog and LOD need 16 bit precision (they might not)
- Assume color is perspective-correct (some think it's ok to relax
this)
- 8 bit precision is good enough for all blending
- Need to double check for any overlooked multiplies
So:
Interpolation correction: 4
(16 interpolants / 4 clocks)
Perspective reciprocal: 2
(1 reciprocal x 2 pixels)
Perspective correction: 30
(15 parameters x 2 pixels)
Not allocated: 4
----
Total: 40 dedicated 18x18 multipliers
This will deliver 2 pixels/clock most of the time, but will stall up to
3 clocks on short spans. Worst case is to fill the screen with single
pixel vertical strips, which will cut the fill rate by 75%. Typical
throughput reduction should be much less, perhaps in the 5-10% range.
If there really are four multipliers left over, they could be applied
to cutting the span setup stall to a single clock.
Comments/flames?
Regards,
Daniel
_______________________________________________
Open-graphics mailing list
[email protected]
http://lists.duskglow.com/mailman/listinfo/open-graphics
List service provided by Duskglow Consulting, LLC (www.duskglow.com)