On Sun, Sep 15, 2013 at 8:13 AM, Kevin <k...@gatorgraphics.com> wrote:
> My attempts to compile it vectorized on OS X seemed to have failed since I 
> don't find a vector instruction in the .o file even though the options 
> -msse4.1 -funroll-loops -ftree-vectorize should be supported according to the 
> man page for Apple's llvm-gcc.

I'm not sure what version of LLVM Apple is using for llvm-gcc. I know
that clang+llvm 3.3 can successfully vectorize the checksum algorithm
when -O3 is used.

> So, has anyone compiled checksum vectorized on OS X? Are there any 
> performance data that would indicate whether or not I should worry with this 
> in the first place?

Even without vectorization the worst case performance hit is about
20%. This is for a workload that is fully bottlenecked on swapping
pages in between shared buffers and OS cache. In real world cases it's
hard to imagine it having any measurable effect. A single core can
checksum several gigabytes per second of I/O without vectorization,
and about 30GB/s with vectorization.

Regards,
Ants Aasma
-- 
Cybertec Schönig & Schönig GmbH
Gröhrmühlgasse 26
A-2700 Wiener Neustadt
Web: http://www.postgresql-support.de


-- 
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Reply via email to