On Mon, 2009-10-05 at 01:38 -0400, Alex Deucher wrote: 
> 2009/10/3 Michel Dänzer <[email protected]>:
> > From: Michel Dänzer <[email protected]>
> >
> > Also add fast paths for untransformed Composite operations.
> >
> > This can significantly reduce the CPU overhead in RadeonCompositeTileCP, at
> > least for TCL capable GPUs.
> > ---
> >
> > I think the basic idea is sound, but I'm not sure if some parts are going 
> > too
> > far, e.g. the float fw, fh locals in the fastpath. Opinions?
> 
> 
> Looks pretty good.  What sort of improvements are you seeing?

Not sure I've measured this one separately, but together with the
changes I pushed recently I've seen an x11perf -aa10text speedup on the
order of 10-20%, both with and without KMS.

> Are there any improvements to the non-tcl path? 

Hmm probably not as is, but it might be possible to use the fast path as
well at least in the untransformed case.

> If you wanted to take this a step further you could add some instructions
> take the reciprocal in the shader.

Right, but I wouldn't expect that to make any significant difference,
the setup overhead seems small compared to RadeonCompositeTileCP. Also
I'm not planning to mess with shaders in such a low-level form, but feel
free. :)

> Also, we don't yet take advantage of the tcl hw on r1xx and r2xx chips.

Yeah, that might be a worthwhile project for those with such hardware.


-- 
Earthling Michel Dänzer           |                http://www.vmware.com
Libre software enthusiast         |          Debian, X and DRI developer
_______________________________________________
xorg-driver-ati mailing list
[email protected]
http://lists.x.org/mailman/listinfo/xorg-driver-ati

Reply via email to