[HACKERS] Logical replication & oldest XID.

Konstantin Knizhnik Tue, 31 May 2016 07:53:50 -0700

Hi,

We are using logical replication in multimaster and are faced with someinteresting problem with "frozen" procArray->replication_slot_xmin.This variable is adjusted by ProcArraySetReplicationSlotXmin which isinvoked by ReplicationSlotsComputeRequiredXmin, whichis in turn is called by LogicalConfirmReceivedLocation. If transactionsare executed at all nodes of multimaster, then everything works fine:replication_slot_xmin is advanced. But if we send transactions only toone multimaster node and broadcast this changes to other nodes, then nodata is send through replications slot at this nodes. No data sends - noconfirmations, LogicalConfirmReceivedLocation is not called andprocArray->replication_slot_xmin preserves original value 599.

As a result GetOldestXmin function always returns 599, so autovacuum isactually blocked and our multimaster is not able to perform cleanup ofXID->CSN map, which cause shared memory overflow. This situation happensonly when write transactions are sent only to one node or if there areno write transactions at all.

Before implementing some workaround (for example forces all ofReplicationSlotsComputeRequiredXmin), I want to understand if it is realproblem of logical replication or we are doing something wrong? BDRshould be faced with the same problem if all updates are performed fromone node...


--
Konstantin Knizhnik
Postgres Professional: http://www.postgrespro.com
The Russian Postgres Company



--
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

[HACKERS] Logical replication & oldest XID.

Reply via email to