Re: [HACKERS] Inconsistent DB data in Streaming Replication

Shaun Thomas Wed, 10 Apr 2013 07:26:42 -0700

On 04/10/2013 09:10 AM, Tom Lane wrote:

IOW, I wouldn't consider skipping the rsync even if I had a feature
like this.

Totally. Out in the field, we consider the "old" database corrupt themoment we fail over. There is literally no way to verify the safety ofany data along the broken chain, given race conditions and multiplepotential failure points.

The only potential use case for this that I can see, would be for systemmaintenance and a controlled failover. I agree: that's a major PITA whendoing DR testing, but I personally don't think this is the way to fixthat particular edge case.

Maybe checksums will fix this in the long run... I don't know. DRBD hasa handy block-level verify function for things like this, and it canre-sync master/slave data by comparing the commit log across the serversif you tell it one node should be considered incorrect.

The thing is... we have clogs, and we have WAL. If we can assumebidirectional communication and verification (checksum comparison?) ofboth of those components, the database *should* be able to re-sync itself.

Even if that were possible given the internals, I can't see anyonejumping on this before 9.4 or 9.5 unless someone sponsors the feature.

Automatic re-sync would (within available WALs) be an awesome feature,though...


--
Shaun Thomas
OptionsHouse | 141 W. Jackson Blvd. | Suite 500 | Chicago IL, 60604
312-676-8870
stho...@optionshouse.com

______________________________________________

See http://www.peak6.com/email_disclaimer/ for terms and conditions related to 
this email


--
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] Inconsistent DB data in Streaming Replication

Reply via email to