[HACKERS] SSI performance

Heikki Linnakangas Fri, 04 Feb 2011 06:29:47 -0800

We know that the predicate locking introduced by the serializablesnapshot isolation patch adds a significant amount of overhead, whenit's used. It was fixed for sequential scans by acquiring a relationlevel lock upfront and skipping the locking after that, but the generalproblem for index scans and bitmap index scans remains.


I ran a little benchmark of that:


postgres=# begin isolation level repeatable read;
BEGIN
Time: 0,262 ms
postgres=#  SELECT COUNT(*) FROM foo WHERE id < 400000;
 count
--------
 399999
(1 row)

Time: 204,571 ms

postgres=# begin isolation level serializable;
BEGIN
Time: 0,387 ms
postgres=#  SELECT COUNT(*) FROM foo WHERE id < 400000;
 count
--------
 399999
(1 row)

Time: 352,293 ms

These numbers are fairly repeatable.

I ran oprofile to see where the time is spent, and fed the output tokcachegrind to get a better breakdown. Attached is a screenshot of that(I don't know how to get this information in a nice text format, sorry).As you might expect, about 1/3 of the CPU time is spent inPredicateLockTuple(), which matches with the 50% increase in executiontime compared to repeatable read. IOW, all the overhead comes fromPredicateLockTuple.

The interesting thing is that CoarserLockCovers() accounts for 20% ofthe overall CPU time, or 2/3 of the overhead. The logic ofPredicateLockAcquire is:


1. Check if we already have a lock on the tuple.
2. Check if we already have a lock on the page.
3. Check if we already have a lock on the relation.

So if you're accessing a lot of rows, so that your lock is promoted to arelation lock, you perform three hash table lookups on everyPredicateLockAcquire() call to notice that you already have the lock.

I was going to put a note at the beginning of this mail saying upfrontthat this is 9.2 materila, but it occurs to me that we could easily justreverse the order of those tests. That would cut the overhead of thecase where you already have a relation lock by 2/3, but make the casewhere you already have a tuple lock slower. Would that be a good tradeoff?


For 9.2, perhaps a tree would be better than a hash table for this..
--
  Heikki Linnakangas
  EnterpriseDB   http://www.enterprisedb.com

<<attachment: ssi-kcachegrind-screenshot.png>>

-- 
Sent via pgsql-hackers mailing list ([email protected])
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

[HACKERS] SSI performance

Reply via email to