Re: [HACKERS] advance local xmin more aggressively

Heikki Linnakangas Wed, 10 Dec 2014 09:59:24 -0800

On 12/10/2014 06:56 PM, Robert Haas wrote:

On Wed, Dec 10, 2014 at 9:49 AM, Robert Haas <[email protected]> wrote:

I guess this bears some further thought.  I certainly don't like the
fact that this makes the whole system crap out at a lower number of
subtransactions than presently.  The actual performance numbers don't
bother me very much; I'm comfortable with the possibility that closing
a cursor will be some modest percentage slower if you've got thousands
of active savepoints.


Here's a new version with two changes:

1. I changed the traversal of the resource owner tree to iterate
instead of recurse.  It now does a depth-first, pre-order traversal of
the tree; when we reach the last child of a node, we follow its parent
pointer to get back to where we were.  That way, we don't need to keep
anything on the stack.  That fixed the crash at 100k cursors, but it
was still 4x slower.

Clever. Could we use that method in ResourceOwnerReleaseInternal andResourceOwnerDelete, too? Might be best to have aResourceOwnerWalk(resowner, callback) function for walking all resourceowners in a tree, instead of one for walking the snapshots in them.

2. Instead of traversing the tree until we find an xmin equal to the
one we're currently advertising, the code now traverses the entire
tree each time it runs. However, it also keeps a record of how many
times the oldest xmin occurred in the tree, which is decremented each
time we unregister a snapshot with that xmin; the traversal doesn't
run again until that count reaches 0.  That fixed the performance
regression on your test case.

With a million subtransactions:

master 34.464s 33.742s 34.317s
advance-xmin 34.516s 34.069s 34.196s

Well, you can still get the slowness back by running other stuff in thebackground. I admit that it's a very obscure case, probably fine inpractice. I would still feel better if snapmgr.c did its ownbookkeeping, from an aesthetic point of view. In a heap, or even just alinked list, if the performance characteristics of that is acceptable.

It occurs to me that the pairing heap I just posted in another thread(http://www.postgresql.org/message-id/[email protected]) wouldbe a good fit for this. It's extremely cheap to insert to and to findthe minimum item (O(1)), and the delete operation is O(log N),amortized. I didn't implement a delete operation, for removing aparticular node, I only did delete-min, but it's basically the samecode. Using a pairing heap for this might be overkill, but if we havethat implementation anyway, the code in snapmgr.c to use it would bevery simple, so I see little reason not to. It might even be simplerthan your patch, because you wouldn't need to have the heuristics onwhether to attempt updating the xmin; it would be cheap enough to alwaystry it.


- Heikki



--
Sent via pgsql-hackers mailing list ([email protected])
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] advance local xmin more aggressively

Reply via email to