[
https://issues.apache.org/jira/browse/HBASE-21539?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16705764#comment-16705764
]
Duo Zhang commented on HBASE-21539:
-----------------------------------
[~zghaobac] FYI.
> Should add backoff when replaying failed in SyncReplicationReplayWALProcedure
> -----------------------------------------------------------------------------
>
> Key: HBASE-21539
> URL: https://issues.apache.org/jira/browse/HBASE-21539
> Project: HBase
> Issue Type: Sub-task
> Reporter: Duo Zhang
> Priority: Major
>
> I'm still testing serial&sync replication and it is stuck again...
> Still need to find out the root cause but there is another problem, since the
> replication is stuck, we have lots of wals to replay, and cause too much
> pressure on the memstore and the region rejects the write requests so the
> SyncReplicationReplayWALRemoteProcedure fails. But soon we will schedule a
> new SyncReplicationReplayWALRemoteProcedure without any sleeps, which means
> we are keep adding pressure to the memstore. The result is very clear, we can
> not finish the replay, and write too much duplicated data to the region, and
> can not recover any more...
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)