[
https://issues.apache.org/jira/browse/HBASE-28850?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17882776#comment-17882776
]
Andrew Kyle Purtell edited comment on HBASE-28850 at 9/18/24 9:43 PM:
----------------------------------------------------------------------
It makes sense not to return "success" to the source unless all edits in the
batch are confirmed to be completed.
The trade off is this may change the performance of replicateEntries. Before,
the replicateEntries call will return at the first failure. After, the
replicateEntries will block for the entire time it takes to confirm all local
edits in the batch are applied or failed.
was (Author: apurtell):
It can be a reasonable design decision to not return "success" to the source
unless all edits in the batch are confirmed to be applied. It would make
failure handling more reliable because we may miss local exceptions, and return
"success" back to the source prematurely, if we are not waiting for all futures
to complete.
It may change the performance of replicateEntries. Before, the replicateEntries
call will return as soon as the local edits on the sink are _scheduled_ for
application. After, the replicateEntries will block for the entire time it
takes to confirm all local edits in the batch are applied, including waiting
for retries and backoff if there is local server unavailability. And that may
cause a reduction in replication throughput because of the new backpressure on
the source, as we block in replicateEntries for what I assume will be a longer
time on average.
> Only return from ReplicationSink.replicationEntries while all background
> tasks are finished
> -------------------------------------------------------------------------------------------
>
> Key: HBASE-28850
> URL: https://issues.apache.org/jira/browse/HBASE-28850
> Project: HBase
> Issue Type: Improvement
> Components: Replication, rpc
> Reporter: Duo Zhang
> Assignee: Duo Zhang
> Priority: Major
> Labels: pull-request-available
>
--
This message was sent by Atlassian Jira
(v8.20.10#820010)