Re: [HACKERS] [RFC][PATCH] wal decoding, attempt #2 - Design Documents (really attached)

Steve Singer Mon, 15 Oct 2012 17:27:15 -0700

On 12-10-15 04:51 PM, Andres Freund wrote:


Well, as a crosscheck, could you list your requirements?

Do you need anything more than outputting data in a format compatible to whats
stored in sl_log_*? You wouldn't have sl_actionseq, everything else should be
there (Well, you would need to do lookups to get the tableid, but thats not
really much of a problem). The results would be ordered in complete
transactions, in commit order.

I guess the other tables would stay as they are as they contain the "added
value" of slony?

Greetings,

I actually had spent some time a few weeks ago looking over thedocuments and code. I never did get around to writing a review aselegant as Peter's. I have not seen any red flags that make me thingthat what your proposing wouldn't be suitable for slony but sometimesyou don't see details until you start implementing something.

My initial approach to modifying slony to work with this might besomething like:

* Leave sl_event as is for non SYNC events, slon would still generateSYNC events in sl_event* We would modify the remote_worker thread in slon to instead ofselecting from sl_event it would get the the next 'committed'transaction from your apply cache. For each ApplyChange record wewould check to see if it is an insert into sl_event ,if so we wouldtrigger our existing event processing logic based on the contents of theev_type column.* If the change involves a insert/update/delete/truncate to a replicatedtable we would translate that change into SQL and apply it on thereplica, we would not commit changes on the replica until we encountera SYNC being added to sl_event for the current origin.* SQL will be applied in a slightly different order than slony doestoday. Today if two concurrent transactions are inserting into the samereplicated table and they commit one after the other there is a goodchance that the apply order on the replica will also be intermixed(assuming both commits were in between two SYNC events). My thinking isthat we would just replay them one after the other on the replica incommit order. (Slony doesn't use commit order because we don't have it,not because we don't like it) this would mean we do away with trackingthe action id.

* If a node is configured as a 'forwarder' not it would store theprocessed output of each ApplyChange record in a table on the replica.If a slon is pulling data from a non-orign (ie if remoteWorkerThread_1is pulling data from node 2) then it would need to query this tableinstead of calling the functions that process the ApplyCache contents.

* To subscribe a node we would generate a SYNC event on the provider anddo the copy_set. We would keep track of that SYNC event. The remoteworker would then ignore any data that comes before that SYNC eventwhen it starts pulling data from the apply cache.* DDL events in 2.2+ go into sl_ddl_script (or someting like that) whenwe see INSERT commands to that table we would now to then apply the DDLon the node.

* We would need to continue to populate sl_confirm because nowing whatSYNC events have already been processed by a node is pretty important ina MOVE SET or FAILOVER. It is possible that we might need to stilltrack the xip lists of each SYNC for MOVE SET/FAILOVER but I'm not surewhy/why not.


This is all easier said than implemented


Steve

Andres




--
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] [RFC][PATCH] wal decoding, attempt #2 - Design Documents (really attached)

Reply via email to