[ 
https://issues.apache.org/jira/browse/HBASE-14417?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15452824#comment-15452824
 ] 

Ted Yu commented on HBASE-14417:
--------------------------------

bq. Delete file ref from hbase:backup after backup completes

Did you mean after restore completes ?
What if the user wants to restore to a different destination afterward ?
The removal of hfile ref from hbase:backup can be coupled with the deletion of 
backup(s).

Restoring incremental backup would ship hfiles along with WAL files to 
destination.

Suppose given this sequence of events where f means full backup, b means bulk 
load and i means incremental backup:
* f
* b1 (with hfile1)
* b2 (with hfile2)
* i1
* b3 (with hfile3)
* i2

The design of hfile ref in hbase:backup should make BackupHFileCleaner 
operation efficient.
If we consolidate hfile ref from b1 and b2 into i1, b3 into i2, 
BackupHFileCleaner needs to search backward (across all outstanding incremental 
backups): i2 -> i1 -> f.
If we consolidate hfile ref from b1 and b2 by merging them and storing b1', 
there is no need to search incremental backup(s).

> Incremental backup and bulk loading
> -----------------------------------
>
>                 Key: HBASE-14417
>                 URL: https://issues.apache.org/jira/browse/HBASE-14417
>             Project: HBase
>          Issue Type: New Feature
>    Affects Versions: 2.0.0
>            Reporter: Vladimir Rodionov
>            Assignee: Vladimir Rodionov
>            Priority: Critical
>              Labels: backup
>             Fix For: 2.0.0
>
>
> Currently, incremental backup is based on WAL files. Bulk data loading 
> bypasses WALs for obvious reasons, breaking incremental backups. The only way 
> to continue backups after bulk loading is to create new full backup of a 
> table. This may not be feasible for customers who do bulk loading regularly 
> (say, every day).



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to