[jira] [Comment Edited] (HBASE-29519) Copy Bulkloaded Files in Continuous Backup

Tak-Lon (Stephen) Wu (Jira) Wed, 13 Aug 2025 12:50:27 -0700


    [ 
https://issues.apache.org/jira/browse/HBASE-29519?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=18013771#comment-18013771
 ]


Tak-Lon (Stephen) Wu edited comment on HBASE-29519 at 8/13/25 7:49 PM:
-----------------------------------------------------------------------

I reviewed the current PR, and I'm wondered this blocking upload may even slow 
down the WAL edits replication, so either having async upload or a after-all 
WAL closed uploaded could solve the blocking issue. 

can you help to explain how we will resume the upload for bulkloaded files if 
it fail? 

When you said in-memory structure after the WAL file is closing (yet closed and 
during the COMMIT stage), if the bulkloaded files fail to upload, are we going 
to abandon this WAL file and resync/resume from where the WAL replication 
starts? the concern is about the replication queue, could this cause another 
WAL backlog cascade? with in-memory or store it in some persistent table, what 
if the list of bulkloaded files are huge?



was (Author: taklwu):
I reviewed the current PR, and I'm wondered this blocking upload may even slow 
down the WAL edits replication, so either having async upload or a after-all 
WAL closed uploaded could solve the blocking issue.

can you help to explain how we will resume the upload for bulkloaded files if 
it fail? When you said in-memory structure after the WAL file is closing (yet 
closed and during the COMMIT stage), if the bulkloaded files fail to upload, 
are we going to abandon this WAL file and resync/resume from where the WAL 
replication starts? the concern is about the replication queue, could this 
cause another WAL backlog cascade? 

> Copy Bulkloaded Files in Continuous Backup
> ------------------------------------------
>
>                 Key: HBASE-29519
>                 URL: https://issues.apache.org/jira/browse/HBASE-29519
>             Project: HBase
>          Issue Type: Sub-task
>          Components: backup&amp;restore
>            Reporter: Vinayak Hegde
>            Assignee: Vinayak Hegde
>            Priority: Major
>              Labels: pull-request-available
>
> Enhance the continuous backup replication endpoint to detect bulkload 
> operations and copy their HFiles to the backup location (e.g., S3). 



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

[jira] [Comment Edited] (HBASE-29519) Copy Bulkloaded Files in Continuous Backup

Reply via email to