[ 
https://issues.apache.org/jira/browse/HBASE-12988?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14590320#comment-14590320
 ] 

Lars Hofhansl commented on HBASE-12988:
---------------------------------------

Any comments on this:
{code}
int n = Math.min(this.maxThreads, replicationSinkMgr.getSinks().size()/100+1);
{code}?

I'm trying to avoid breaking up very small batches. I.e. if the batch has only 
50 edits to ship it does not make much sense to break that up into 5 chunks of 
1 edit. Even when these edits are large... In that case the network time will 
dominate and breaking up won't add much.
So I pulled 100 out of my hat.

Also maybe I should add a Math.min(..., number of sinks), so that we are not 
using more threads than we have sinks region servers on the other side.

> [Replication]Parallel apply edits on row-level
> ----------------------------------------------
>
>                 Key: HBASE-12988
>                 URL: https://issues.apache.org/jira/browse/HBASE-12988
>             Project: HBase
>          Issue Type: Improvement
>          Components: Replication
>            Reporter: hongyu bi
>            Assignee: Lars Hofhansl
>         Attachments: 12988.txt, HBASE-12988-0.98.patch, 
> ParallelReplication-v2.txt
>
>
> we can apply  edits to slave cluster in parallel on table-level to speed up 
> replication .
> update : per conversation blow , it's better to apply edits on row-level in 
> parallel



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to