[Pixman] [PATCH 2/2] ssse3: Add iterator for separable bilinear scaling

2013-08-29 Thread Søren Sandmann Pedersen
This new iterator uses the SSSE3 instructions pmaddubsw and pabsw to implement a fast iterator for bilinear scaling. There is a graph here recording the per-pixel time for various bilinear scaling algorithms as reported by scaling-bench:

Re: [Pixman] [PATCH 0/2] SSSE3 iterator and fast path selection issues

2013-08-29 Thread Chris Wilson
On Thu, Aug 29, 2013 at 01:02:51PM -0400, Søren Sandmann Pedersen wrote: The following patches add a new SSSE3 implementation and an iterator for separable bilinear scaling. As expected, the new iterator is clearly faster than the C iterator. Unfortunately or fortunately, when combined with

Re: [Pixman] testsuite fails on power7

2013-08-29 Thread Siarhei Siamashka
On Thu, 29 Aug 2013 15:18:57 -0400 Lennart Sorensen lsore...@csclub.uwaterloo.ca wrote: I get crashes in the scaling and affinity tests on power7. The crashes are always in the vmx code, so building with vmx support disabled makes the problem go away. The error is not consistent, so my

[Pixman] [PATCH] general: Ensure that iter buffers are aligned to 16 bytes

2013-08-29 Thread Søren Sandmann
From: Søren Sandmann Pedersen s...@redhat.com At the moment iter buffers are only guaranteed to be aligned to a 4 byte boundary. It is useful for SIMD implementations to be able to assume that these buffers are aligned to 16 bytes, so ensure this. V2: - Use uintptr_t instead of unsigned long -

Re: [Pixman] testsuite fails on power7

2013-08-29 Thread Siarhei Siamashka
On Thu, 29 Aug 2013 16:23:49 -0400 Lennart Sorensen lsore...@csclub.uwaterloo.ca wrote: On Thu, Aug 29, 2013 at 10:56:49PM +0300, Siarhei Siamashka wrote: On Thu, 29 Aug 2013 15:18:57 -0400 Lennart Sorensen lsore...@csclub.uwaterloo.ca wrote: I get crashes in the scaling and affinity