Re: [HACKERS] [REVIEW] pg_last_xact_insert_timestamp

Greg Smith Mon, 12 Dec 2011 14:09:26 -0800

On 12/12/2011 08:45 AM, Robert Haas wrote:

But I'm skeptical that anything that we only update once percheckpoint cycle will help much in
calculating an accurate lag value.

I'm sure there is no upper bound on how much WAL lag you can build upbetween commit/abort records either; they can be far less frequent thancheckpoints. All it takes is a multi-hour COPY with no other commits tocompletely hose lag measured by that advance, and that is not an unusualsituation at all. Overnight daily ETL or reporting MV-ish roll-ups,scheduled specifically for when no one is normally at the office, arethe first thing that spring to mind.

Anyway, I wasn't suggesting checkpoints as anything other than a worstcase behavior. We can always thump out more frequent updates to reducelag, and in what I expect to the most common case the WAL send/receivestuff will usually do much better. I see the XID vs. WAL position UIissues as being fundamentally unsolvable, which really bothers me. I'dhave preferred to run screaming away from this thread if it hadn't.

It also strikes me that anything that is based on augmenting the 
walsender/walreceiver protocol leaves
anyone who is using WAL shipping out in the cold.  I'm not clear from
the comments you or Simon have made how important you think that use
case still is.

There's a number of reasons why we might want more timestamps streamedinto the WAL; this might be one. We'd just need one to pop out one aspart of the archive_timeout switch to in theory make it possible forthese people to be happy. I think Simon was hoping to avoid WALtimestamps, I wouldn't bet too much on that myself. The obviousimplementation problem here is that the logical place to put thetimestamps is right at the end of the WAL file, just before it's closedfor archiving. But that position isn't known until you've at leaststarted processing it, which you clearly are not doing fast enough iflag exists.

As far as who's still important here, two observations. Note that thepg_last_xact_insert_timestamp approach can fail to satisfy WAL shippingpeople who are going to a separate network, where it's impractical toconnect to both servers with libpq. I have some customers who likeputting a one-way WAL wall (sorry) between production and the standbyserver, with the log shipping being the only route between them; that'sone reason why they might still be doing this instead of usingstreaming. There's really no good way to make these people happy andprovide time lag monitoring inside the database.

I was actually the last person I recall who suggested some extramonitoring mainly aimed at WAL shipping environments:http://archives.postgresql.org/pgsql-hackers/2010-01/msg01522.php Hadsome pg_standby changes I was also working on back then, almost twoyears ago. I never circled back to any of it due to having zero demandsince 9.0 shipped, the requests I had been regularly getting about thisall dried up. While I'm all for keeping new features working foreveryone when it doesn't hold progress back, it's not unreasonable torecognize we can't support every monitoring option through all of theweird ways WAL files can move around. pg_stat_replication isn't veryhelpful for 9.0+ WAL shippers either, yet they still go on doing theirthing.

In the other direction, people who will immediately adopt the latesthotness, cascading is a whole new layer of use case concerns on top ofthe ones considered so far. Now you're talking two layers ofconnections users have to navigate though to compute master->cascadedstandby lag. Cascade the WALSender timestamps instead, which seemspretty simple to do, and then people can just ask their local standby.


--
Greg Smith   2ndQuadrant US    g...@2ndquadrant.com   Baltimore, MD
PostgreSQL Training, Services, and 24x7 Support  www.2ndQuadrant.us


--
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] [REVIEW] pg_last_xact_insert_timestamp

Reply via email to