Re: [HACKERS] Enabling Checksums

Greg Smith Wed, 17 Apr 2013 17:22:28 -0700

On 4/17/13 6:32 PM, Tom Lane wrote:

The more I read of this thread, the more unhappy I get.  It appears that
the entire design process is being driven by micro-optimization for CPUs
being built by Intel in 2013.

And that's not going to get anyone past review, since all the tests I'vebeen doing the last two weeks are on how fast an AMD Opteron 6234 withOS cache >> shared_buffers can run this. The main thing I'm stillworried about is what happens when you have a fast machine that can movememory around very quickly and an in-memory workload, but it's hamstrungby the checksum computation--and it's not a 2013 Intel machine.

The question I started with here was answered to some depth and thenskipped past. I'd like to jerk attention back to that, since I thoughtsome good answers from Ants went by. Is there a simple way to optimizethe committed CRC computation (or a similar one with the same errordetection properties) based on either:

a) Knowing that the input will be a 8K page, rather than the existinguse case with an arbitrary sized WAL section.


b) Straightforward code rearrangement or optimization flags.

That was all I thought was still feasible to consider changing for 9.3 afew weeks ago. And the possible scope has only been shrinking since then.

And I reiterate that there is theory out there about the error detection
capabilities of CRCs.  I'm not seeing any theory here, which leaves me
with very little confidence that we know what we're doing.

Let me see if I can summarize where the messages flying by are at sinceyou'd like to close this topic for now:

-Original checksum feature used Fletcher checksums. Its main problems,to quote wikipedia, include that it "cannot distinguish between blocksof all 0 bits and blocks of all 1 bits".

-Committed checksum feature uses truncated CRC-32. This has known gooderror detection properties, but is expensive to compute. There's reasonto believe that particular computation will become cheaper on futureplatforms though. But taking full advantage of that will require addingCPU-specific code to the database.

-The latest idea is using the Fowler–Noll–Vo hash function:https://en.wikipedia.org/wiki/Fowler_Noll_Vo_hash There's 20 years ofresearch around when that is good or bad. The exactly properties dependon magic "FNV primes": http://isthe.com/chongo/tech/comp/fnv/#fnv-primethat can vary based on both your target block size and how many bytesyou'll process at a time. For PostgreSQL checksums, one of the commonproblems--getting an even distribution of the hashed values--isn'timportant the way it is for other types of hashes. Ants and Florianhave now dug into how exactly that and specific CPU optimizationconcerns impact the best approach for 8K database pages. This is veryclearly a 9.4 project that is just getting started.


--
Greg Smith   2ndQuadrant US    g...@2ndquadrant.com   Baltimore, MD
PostgreSQL Training, Services, and 24x7 Support www.2ndQuadrant.com


--
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] Enabling Checksums

Reply via email to