This new iterator uses the SSSE3 instructions pmaddubsw and pabsw to
implement a fast iterator for bilinear scaling.
There is a graph here recording the per-pixel time for various
bilinear scaling algorithms as reported by scaling-bench:
On Thu, Aug 29, 2013 at 01:02:51PM -0400, Søren Sandmann Pedersen wrote:
The following patches add a new SSSE3 implementation and an iterator
for separable bilinear scaling. As expected, the new iterator is
clearly faster than the C iterator.
Unfortunately or fortunately, when combined with
On Thu, 29 Aug 2013 15:18:57 -0400
Lennart Sorensen lsore...@csclub.uwaterloo.ca wrote:
I get crashes in the scaling and affinity tests on power7. The crashes
are always in the vmx code, so building with vmx support disabled makes
the problem go away.
The error is not consistent, so my
From: Søren Sandmann Pedersen s...@redhat.com
At the moment iter buffers are only guaranteed to be aligned to a 4
byte boundary. It is useful for SIMD implementations to be able to
assume that these buffers are aligned to 16 bytes, so ensure this.
V2:
- Use uintptr_t instead of unsigned long
-
On Thu, 29 Aug 2013 16:23:49 -0400
Lennart Sorensen lsore...@csclub.uwaterloo.ca wrote:
On Thu, Aug 29, 2013 at 10:56:49PM +0300, Siarhei Siamashka wrote:
On Thu, 29 Aug 2013 15:18:57 -0400
Lennart Sorensen lsore...@csclub.uwaterloo.ca wrote:
I get crashes in the scaling and affinity