[ 
https://issues.apache.org/jira/browse/HBASE-12988?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14541365#comment-14541365
 ] 

Lars Hofhansl commented on HBASE-12988:
---------------------------------------

Looking into this. This is a bit more intricate than expected. An Entry in the 
WAL is a WALKey/WALEdit pair. The WALKey (among other stuff has table and 
region name), the WALEdit has the cells, which in turn have the row. A WALEdit 
can contain Cells for many rows.

So to group by row we would need to pull WALEdits apart. I'd have to think 
through the implication to convince myself that that's OK. On top of that after 
the row-grouping we'd now have to build _new_ WALEdits, since they'd have a 
different set of Cells. Ugh...

Grouping entire WALEdits on the other hand is easier, but those I can only 
group by table, and - as discussed above - that might not be effective in many 
scenarios.

I can easily do a size based breakup and then rely on the fact that we keep 
deletes around for a little bit, but that is brittle... I do have a test patch 
for this.


> [Replication]Parallel apply edits on row-level
> ----------------------------------------------
>
>                 Key: HBASE-12988
>                 URL: https://issues.apache.org/jira/browse/HBASE-12988
>             Project: HBase
>          Issue Type: Improvement
>          Components: Replication
>            Reporter: hongyu bi
>            Assignee: hongyu bi
>
> we can apply  edits to slave cluster in parallel on table-level to speed up 
> replication .
> update : per conversation blow , it's better to apply edits on row-level in 
> parallel



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to