>
> Thanks for the comments, these are handled in the attached v20250516
> version patch.
>

Thanks for the patches. Here are my review comments -

Patch-0004: src/backend/replication/logical/sequencesync.c

The sequence count logic using curr_seq in copy_sequences() seems buggy.
Currently, curr_seq is incremented based on the number of tuples
received from the publisher inside the inner while loop.
This means it's counting the number of sequences returned by the
publisher, not the number of sequences processed locally. This can
lead to two issues:

1) Repeated syncing of sequences:
If some sequences are missing on the publisher, curr_seq will reflect
fewer items than expected, and subsequent batches may reprocess
already-synced sequences. Because next batch will use curr_seq to get
values from the list as -

  seqinfo = (LogicalRepSequenceInfo *)
lfirst(list_nth_cell(remotesequences, curr_seq + i));

Example:
For 110 sequences(s1 to s110), if 5 (s1 to s5) are missing on the
publisher in the first batch, curr_seq = 95. In the next cycle, we
resync s95 to s99.
~~~~

2) Risk of sequencesync worker getting stuck in infinite loop

Consider a case where remotesequences has 10 sequences (s1–s10) need
syncing, and concurrently s9, s10 are deleted on the publisher.

Cycle 1:
Publisher returns s1–s8. So curr_seq = 8.

Cycle 2:
Publisher query returns zero rows (as s9, s10 no longer exist).
curr_seq stays at 8 and never advances.

This causes the while (curr_seq < total_seq) loop to run forever.
~~~~

I think curr_seq  should be incremented by batch_seq_count just
outside the inner while loop.

--
Thanks,
Nisha


Reply via email to