[
https://issues.apache.org/jira/browse/HBASE-2129?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12800727#action_12800727
]
Jean-Daniel Cryans commented on HBASE-2129:
-------------------------------------------
bq. Are there any strategies to parallelize this operation for busy clusters?
Yes, this is one of the things that I scaled down in order to present a patch
sooner. Currently the ReplicationSink cannot get behind edits coming from the
master cluster since its blocking the replication of new entries. The next step
is to allow appenders while the file is being read (logs replayed), then once
parallelized (HBASE-2066) puts are available to use them in that thread.
The plan for the logs themselves in ReplicationSink is to be able to let the
thread get behind on replaying edits, for example because the slave cluster is
very busy, and that if the region server dies to be able to use those logs to
replay them somewhere else.
> Simple Master/Slave replication
> -------------------------------
>
> Key: HBASE-2129
> URL: https://issues.apache.org/jira/browse/HBASE-2129
> Project: Hadoop HBase
> Issue Type: Sub-task
> Reporter: Jean-Daniel Cryans
> Assignee: Jean-Daniel Cryans
> Fix For: 0.21.0
>
> Attachments: HBASE-2129.patch
>
>
> We should first build a very simple replication mechanism to validate our
> assumptions and get a feel of what replication is in this very distributed
> context.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.