Re: [HACKERS] Hot standby, recovery infra

Heikki Linnakangas Thu, 05 Feb 2009 01:47:42 -0800

Simon Riggs wrote:

On Thu, 2009-02-05 at 10:31 +0200, Heikki Linnakangas wrote:
Simon Riggs wrote:
On Thu, 2009-02-05 at 09:28 +0200, Heikki Linnakangas wrote:
I got rid of minSafeStartPoint, advancing minRecoveryPoint instead. Andit's advanced in XLogFlush instead of XLogFileRead. I'll post an updatedpatch soon.
Why do you think XLogFlush is called less frequently than XLogFileRead?
It's not, but we only need to update the control file when we're"flushing" an LSN that's greater than current minRecoveryPoint. And whenwe do update minRecoveryPoint, we can update it to the LSN of the lastrecord we've read from the archive.
So we might end up flushing more often *and* we will be doing it
potentially in the code path of other users.

For example, imagine a database that fits completely in shared buffers.If we update at every XLogFileRead, we have to fsync every 16MB of WAL.If we update in XLogFlush the way I described, you only need to updatewhen we flush a page from the buffer cache, which will only happen atrestartpoints. That's far less updates.

Expanding that example to a database that doesn't fit in cache, you'restill replacing pages from the buffer cache that have been untouched forlongest. Such pages will have an old LSN, too, so we shouldn't need toupdate very often.

I'm sure you can come up with an example of where we end up fsyncingmore often, but it doesn't seem like the common case to me.

This change seems speculative and also against what has previously been
agreed with Tom. If he chooses not to comment on your changes, that's up
to him, but I don't think you should remove things quietly that have
been put there through the community process, as if they caused

problems. I feel like I'm in the middle here.

I'd like to have the extra protection that this approach gives. If welet safeStartPoint to be ahead of the actual WAL we've replayed, we haveto just assume we're fine if we reach end of WAL before reaching thatpoint. That assumption falls down if e.g recovery is stopped, and you goand remove the last few WAL segments from the archive before restartingit, or signal pg_standby to trigger failover too early. Tracking thereal safe starting point and enforcing it always protects you from that.

(we did discuss this a week ago:http://archives.postgresql.org/message-id/[email protected])


--
  Heikki Linnakangas
  EnterpriseDB   http://www.enterprisedb.com

--
Sent via pgsql-hackers mailing list ([email protected])
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] Hot standby, recovery infra

Reply via email to