Duo Zhang created HBASE-21539:
---------------------------------

             Summary: Should add backoff when replaying failed in 
SyncReplicationReplayWALProcedure
                 Key: HBASE-21539
                 URL: https://issues.apache.org/jira/browse/HBASE-21539
             Project: HBase
          Issue Type: Sub-task
            Reporter: Duo Zhang


I'm still testing serial&sync replication and it is stuck again...

Still need to find out the root cause but there is another problem, since the 
replication is stuck, we have lots of wals to replay, and cause too much 
pressure on the memstore and the region rejects the write requests so the 
SyncReplicationReplayWALRemoteProcedure fails. But soon we will schedule a new 
SyncReplicationReplayWALRemoteProcedure without any sleeps, which means we are 
keep adding pressure to the memstore. The result is very clear, we can not 
finish the replay, and write too much duplicated data to the region, and can 
not recover any more...



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to