Re: Synchronous commit behavior during network outage

Ondřej Žižka Wed, 21 Apr 2021 12:03:40 -0700

Hello,

> You can monitor the pg_stat_activity for the SYNC_REP_WAIT_FLUSH waittypes to detect this.

I tried to see this this wait_event_type Client or IPC and wait_eventClient_Read or SyncRep. In which situation I can see theSYNC_REP_WAIT_FLUSH value?

> You should consider these as in doubt transactions and the clientshould retry. Again, this can happen in a normal server crash case too.For example, a transaction committed on the server and before sendingthe acknowledgement crashed. *The client should know how to handlethese cases.*

I have just a light knowledge of the in-doubt transaction. Need to studymore about it, but in real world the client is mostly 'stupid' and doesexpect only COMMIT or ROLLBACK. Nothing between.

> There is a third problem that I didn't talk about in this threadwhere the async clients (including logical decoding and replicationclients) can get ahead of the new primary and there is no easier way toundo those changes. For this problem, we need to implement some protocolin the WAL sender where it sends the log to the consumer only up to theflush LSN of the standby/quorum replicas. This is something I am workingon right now.

We setup and architecture where are 4 nodes and Patroni as a clustermanager. Two nodes are sync an each sync node has 1 async. In casesomething like this happen (e.g. network to sync replica fails and userpress the CTRL+C), the async replica receives the transaction and applyit. If the outage is longer than some time (30s by default), managementsoftware checks the LSN and create a new sync replica from the ASYNCreplica.


Ondrej

You should consider these as in doubt transactions and the client shouldretry. Again, this can happen in a normal server crash case too. Forexample, a transaction committed on the server and before sending theacknowledgement crashed. The client should know how to handle these cases

On 21/04/2021 09:20, SATYANARAYANA NARLAPURAM wrote:

    This can be an option for us in our case. But there also needs to
    be a process how to detect these "stuck commits" and how to
    invalidate/remove them, because in reality, if the app/user would
    not see the change in the database, it/he/she will try to
    insert/delete it again. If it just stuck without management, it
    will create a queue which can cause, that in the queue there will
    be 2 similar inserts/deletes which can again cause issues (like
    with the primary key I mentioned before).
This shouldn't be a problem as the previous transaction is stillholding the locks and the new transaction is blocked behind this.Outside of the sync replication, this can happen today too withglitches/timeouts/ retries between the client and the server. Am Imissing something?
    So the process should be in this case:

    - DBA receives information, that write operations stuck (DBA in
    coordination with the infrastructure team disconnects all clients
    and prevent new ones to create a new connection).
You can monitor the pg_stat_activity for the SYNC_REP_WAIT_FLUSH waittypes to detect this.
    - DBA will recognize, that there is an issue in communication
    between the primary and the sync replica (caused the issue with
    the propagation of commits)
    - DBA will see that there are some commits that are in the "stuck
    state"
    - DBA removes these stuck commits. Note: Because the client never
    received a confirmation about the successful commit -> changes in
    the DB client tried to perform can't be considered as successful.
You should consider these as in doubt transactions and the clientshould retry. Again, this can happen in a normal server crash casetoo. For example, a transaction committed on the server and beforesending the acknowledgement crashed. The client should know how tohandle these cases.
    - DBA and infrastructure team restore the communication between
    server nodes to be able to propagate commits from the primary node
    to sync replica.
    - DBA and infrastructure team allows new connections to the database

    This approach would require external monitoring and alerting, but
    I would say, that this is an acceptable solution. Would your patch
    be able to perform that?
My patch handles ignoring the cancel events. I ended up keeping theother logic (blocking super user connections in theclient_authentication_hook.
There is a third problem that I didn't talk about in this thread wherethe async clients (including logical decoding and replication clients)can get ahead of the new primary and there is no easier way to undothose changes. For this problem, we need to implement some protocol inthe WAL sender where it sends the log to the consumer only up to theflush LSN of the standby/quorum replicas. This is something I amworking on right now.

Re: Synchronous commit behavior during network outage

Reply via email to