[ 
https://issues.apache.org/jira/browse/HBASE-2129?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12800727#action_12800727
 ] 

Jean-Daniel Cryans commented on HBASE-2129:
-------------------------------------------

bq. Are there any strategies to parallelize this operation for busy clusters?

Yes, this is one of the things that I scaled down in order to present a patch 
sooner. Currently the ReplicationSink cannot get behind edits coming from the 
master cluster since its blocking the replication of new entries. The next step 
is to allow appenders while the file is being read (logs replayed), then once 
parallelized (HBASE-2066) puts are available to use them in that thread.

The plan for the logs themselves in ReplicationSink is to be able to let the 
thread get behind on replaying edits, for example because the slave cluster is 
very busy, and that if the region server dies to be able to use those logs to 
replay them somewhere else.

> Simple Master/Slave replication
> -------------------------------
>
>                 Key: HBASE-2129
>                 URL: https://issues.apache.org/jira/browse/HBASE-2129
>             Project: Hadoop HBase
>          Issue Type: Sub-task
>            Reporter: Jean-Daniel Cryans
>            Assignee: Jean-Daniel Cryans
>             Fix For: 0.21.0
>
>         Attachments: HBASE-2129.patch
>
>
> We should first build a very simple replication mechanism to validate our 
> assumptions and get a feel of what replication is in this very distributed 
> context. 

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to