On Fri, May 16, 2025 at 5:01 PM Amit Kapila <amit.kapil...@gmail.com> wrote:
>

Please find some more comments on the 0001 patch:
1.
We need to know about such transactions
+ * for conflict detection and resolution in logical replication. See
+ * GetOldestTransactionIdInCommit and its use.

Do we need to mention resolution in the above sentence? This patch is
all about detecting conflict reliably.

2. In wait_for_publisher_status(), we use remote_epoch,
remote_nextxid, and remote_oldestxid to compute full transaction id's.
Why can't we send FullTransactionIds for remote_oldestxid and
remote_nextxid from publisher? If these are required, maybe a comment
somewhere for that would be good.

3.
/*
+ * Note it is important to set committs value after marking ourselves as
+ * in the commit critical section (DELAY_CHKPT_IN_COMMIT). This is because
+ * we want to ensure all such transactions are finished before we allow
+ * the logical replication client to advance its xid which is used to hold
+ * back dead rows for conflict detection. See
+ * maybe_advance_nonremovable_xid.
+ */
+ committs = GetCurrentTimestamp();

How does setting committs after setting DELAY_CHKPT_IN_COMMIT help in
advancing client-side xid? IIUC, on client-side, we simply wait for
such an xid to be finished based on the remote_oldestxid and
remote_nextxid sent via the server. So, the above comment is not
completely clear to me. I have updated this and related comments in
the attached diff patch to make it clear. See if that makes sense to
you.

4.
In 0001's commit message, we have: "Furthermore, the preserved commit
timestamps and origin data are essential for
consistently detecting update_origin_differs conflicts." But it is not
clarified how this patch helps in consistently detecting
update_origin_differs conflict?

5. I have tried to add some more details in comments on why
oldest_nonremovable_xid needs to be FullTransactionId. See attached.

-- 
With Regards,
Amit Kapila.
diff --git a/src/backend/access/transam/twophase.c 
b/src/backend/access/transam/twophase.c
index 41fb4fc5025..ea508542745 100644
--- a/src/backend/access/transam/twophase.c
+++ b/src/backend/access/transam/twophase.c
@@ -2329,9 +2329,9 @@ RecordTransactionCommitPrepared(TransactionId xid,
        /*
         * Note it is important to set committs value after marking ourselves as
         * in the commit critical section (DELAY_CHKPT_IN_COMMIT). This is 
because
-        * we want to ensure all such transactions are finished before we allow
-        * the logical replication client to advance its xid which is used to 
hold
-        * back dead rows for conflict detection. See
+        * we want to ensure all transactions that have acquired commit 
timestamp
+        * are finished before we allow the logical replication client to 
advance
+        * its xid which is used to hold back dead rows for conflict detection. 
See
         * maybe_advance_nonremovable_xid.
         */
        committs = GetCurrentTimestamp();
diff --git a/src/backend/access/transam/xact.c 
b/src/backend/access/transam/xact.c
index dfc5108da3c..6e1dc76744f 100644
--- a/src/backend/access/transam/xact.c
+++ b/src/backend/access/transam/xact.c
@@ -1431,6 +1431,10 @@ RecordTransactionCommit(void)
                 * without holding the ProcArrayLock, since we're the only one
                 * modifying it.  This makes checkpoint's determination of 
which xacts
                 * are delaying the checkpoint a bit fuzzy, but it doesn't 
matter.
+                *
+                * Note, it is important to get the commit timestamp after 
marking the
+                * transaction in the commit critical section. See
+                * RecordTransactionCommitPrepared.
                 */
                Assert((MyProc->delayChkptFlags & DELAY_CHKPT_IN_COMMIT) == 0);
                START_CRIT_SECTION();
diff --git a/src/include/replication/worker_internal.h 
b/src/include/replication/worker_internal.h
index 0fdf49a1938..7d1e2096232 100644
--- a/src/include/replication/worker_internal.h
+++ b/src/include/replication/worker_internal.h
@@ -99,7 +99,9 @@ typedef struct LogicalRepWorker
         * conditions. Such scenarios might occur if the replication slot is not
         * yet created by the launcher while the apply worker has already
         * initialized this field. During this period, a transaction ID 
wraparound
-        * could falsely make this ID appear as if it originates from the 
future.
+        * could falsely make this ID appear as if it originates from the future
+        * w.r.t the transaction ID stored in the slot maintained by launcher. 
See
+        * advance_conflict_slot_xmin.
         */
        FullTransactionId oldest_nonremovable_xid;
 
diff --git a/src/include/storage/proc.h b/src/include/storage/proc.h
index 65ecf3280fb..c6f5ebceefd 100644
--- a/src/include/storage/proc.h
+++ b/src/include/storage/proc.h
@@ -133,10 +133,10 @@ extern PGDLLIMPORT int FastPathLockGroupsPerBackend;
  *
  * Setting DELAY_CHKPT_IN_COMMIT is similar to setting DELAY_CHKPT_START, but
  * it explicitly indicates that the reason for delaying the checkpoint is due
- * to a transaction being within a critical commit section. We need this to
- * ensure all such transactions are finished before we allow the logical
- * replication client to advance its xid which is used to hold back dead rows
- * for conflict detection.
+ * to a transaction being within a critical commit section. We need this new
+ * flag to ensure all the transactions that have acquired commit timestamp are
+ * finished before we allow the logical replication client to advance its xid
+ * which is used to hold back dead rows for conflict detection.
  */
 #define DELAY_CHKPT_START              (1<<0)
 #define DELAY_CHKPT_COMPLETE   (1<<1)

Reply via email to