Re: [HACKERS] Let's invent a function to report lock-wait-blocking PIDs

Greg Smith Sun, 24 Mar 2013 07:26:39 -0700

On 3/20/13 2:02 PM, Tom Lane wrote:

If isolationtester were the only market for this type of information,
maybe it wouldn't be worth worrying about.  But I'm pretty sure that
there are a *lot* of monitoring applications out there that are trying
to extract who-blocks-whom information from pg_locks.  I hadn't realized
before quite how painful it is to do that, even incorrectly.

As a FYI, the one Marco wrote here is over 100 lines of code, and whilehe did a great job I'd still never suggest we release it--because it'smisleading in just enough cases to be dangerous. We can run itusefully, but I'd never hand this over to a customer and expect them todo something with it.

I propose that we should add a backend function that simplifies this
type of query.  The API that comes to mind is (name subject to
bikeshedding)

        pg_blocking_pids(pid int) returns int[]

I think there's a whole family of functions like this needed. This isone of them, so if it helps the isolation tester I'd be happy to see itadded as a first one, whether or not more come along one day.

I'd rather get the data back as a SRF because I'd usually be joining itto pg_locks and/or pg_stat_activity to figure out what the blocking pidsown or are doing. You can obviously convert the array form to/from theSRF form. The exposed function API that is easier for users to joinwith is my preference. If the isolation tester is easier to writeagainst the array form, it can play the appropriate nesting game to doso. I see that as the unusual case though, and it is also the one beingcoded by people who know how to handle the conversion.


The longer list of views/functions I keep wanting includes things like:

-What processes are blocking P from running?  [This new function]

-What processes hold locks and are running usefully--they have somelocks but all are granted? [Easy to extract from pg_locks]

-For each running process, which processes are waiting on them?[Requires a long WITH RECURSIVE query that doesn't get trapped bycircular locks]

-If I try to grab lock type L on object O, what existing locks will thatconflict with?

One really magic thing I'd like in this area is EXPLAIN (ANALYZE ON,LOCKS ON) which pops out a list of all the locks acquired when runningthat statement. We're never going to get fully correct documentation ofwhat locks a given statement needs. If I can figure that out in a testenvironment by running the statement there and seeing what locks itgrabbed along the way, that would eliminate most of the need fordocumenting things.

Note that an EXPLAIN based approach doesn't solve all the problems inthis area, because the trickiest ones I run into are ALTER TABLEchanges--which you can't EXPLAIN. Some API that dumps the locks anarbitrary statement acquired just before it exits would be ideal. Whena user can ask "what locks did an ALTER TABLE adding a foreign key takeand what order were they grabbed in?", that would solve the hardest ofthe questions I see in the field.


--
Greg Smith   2ndQuadrant US    g...@2ndquadrant.com   Baltimore, MD
PostgreSQL Training, Services, and 24x7 Support www.2ndQuadrant.com


--
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] Let's invent a function to report lock-wait-blocking PIDs

Reply via email to