Re: [Pixman] Performance of radial gradients

Jeff Muizelaar Mon, 22 Feb 2010 11:33:43 -0800


On 22-Feb-10, at 10:35 AM, Rodrigo Kumpera wrote:

On Mon, Feb 22, 2010 at 1:39 PM, Siarhei Siamashka <[email protected]> wrote:
On Friday 19 February 2010, Luca Barbato wrote:
> On 02/19/2010 12:57 PM, Siarhei Siamashka wrote:
> > Adding small increments to the values at the end of loopiteration could> > be the biggest source of precision loss. Replacing this withexplicit> > calculation like 'pdx = pdx0 + cx * n' should improve precisionand maybe> > allow to use floats freely. And floats work better with SIMD onany
> > platforms.
>
> And all the SIMD we are covering have a multiply-accumulateinstruction
> that would be in use, I'm a bit more concerned about the sqrt usage
> though...

The usage of sqrt is probably not a fatal performance problem.
ARM11 VFP has a separate DS pipeline which can calculate divides orsquareroots simultaneously with the other operations. So it's only amatter of
hiding very high square root calculation latency.
ARM Cortex-A8 has special SIMD instructions intended to helpcalculating
reciprocals and reciprocal square roots using Newton-Raphson method.

SSE has SIMD instructions for calculating square roots.
Note that SSE functions for square root and reciprocal are unfit forthis task as they havevery little precision (12 bits). At least one Newton-Raphsoniteration must be done to have
usable values.

rsqrt is an approximation, however the regular square rootinstructions are not and are fine replacements for the sqrt()function. In fact, gcc on OS X already does this substitution.


-Jeff

_______________________________________________
Pixman mailing list
[email protected]
http://lists.freedesktop.org/mailman/listinfo/pixman

Re: [Pixman] Performance of radial gradients

Reply via email to