[ 
https://issues.apache.org/jira/browse/HBASE-29027?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Yutong Xiao updated HBASE-29027:
--------------------------------
    Description: 
Under our current Replication implementation, all WAL files will be consumed 
and the data will be filtered by a table attribute 
REPLICATION_SCOPE_GLOBAL
 
We are using 2.4.11 version in our production env. And we have a small group 
table to use the replication to do CDC, however with the growth of the number 
of other non-replicated tables, we met the log queue accumulation issue and the 
data lag grew to hours level. 
 
To solve the issue we write the tables, which has CF marked the 
replication_scope to global, to separated WAL files and let the replication 
only consume those files. The feature solved our problems and push it to the 
community. 

In our env, we have 19 tables and only 8 of them need to be replicated. 

There are write heavy tables in the the rest of 11. So that our business 
benefits a lot from this feature.

 
The attached screenshot shows the improvement.


 

  was:
Under our current Replication implementation, all WAL files will be consumed 
and the data will be filtered by a table attribute 
REPLICATION_SCOPE_GLOBAL
 
We are using 2.4.11 version in our production env. And we have a small group 
table to use the replication to do CDC, however with the growth of the number 
of other non-replicated tables, we met the log queue accumulation issue and the 
data lag grew to hours level. 
 
To solve the issue we write the tables, which has CF marked the 
replication_scope to global, to separated WAL files and let the replication 
only consume those files. The feature solved our problems and push it to the 
community. 
 
 
 


> Introduce a new WALProvider to generate WAL files consumed by Replication
> -------------------------------------------------------------------------
>
>                 Key: HBASE-29027
>                 URL: https://issues.apache.org/jira/browse/HBASE-29027
>             Project: HBase
>          Issue Type: Improvement
>          Components: Replication, wal
>            Reporter: Yutong Xiao
>            Assignee: Yutong Xiao
>            Priority: Major
>              Labels: pull-request-available
>         Attachments: Screenshot 2024-12-11 at 17.46.40.png
>
>
> Under our current Replication implementation, all WAL files will be consumed 
> and the data will be filtered by a table attribute 
> REPLICATION_SCOPE_GLOBAL
>  
> We are using 2.4.11 version in our production env. And we have a small group 
> table to use the replication to do CDC, however with the growth of the number 
> of other non-replicated tables, we met the log queue accumulation issue and 
> the data lag grew to hours level. 
>  
> To solve the issue we write the tables, which has CF marked the 
> replication_scope to global, to separated WAL files and let the replication 
> only consume those files. The feature solved our problems and push it to the 
> community. 
> In our env, we have 19 tables and only 8 of them need to be replicated. 
> There are write heavy tables in the the rest of 11. So that our business 
> benefits a lot from this feature.
>  
> The attached screenshot shows the improvement.
>  



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to