[ 
https://issues.apache.org/jira/browse/HBASE-19106?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16225942#comment-16225942
 ] 

Ted Yu commented on HBASE-19106:
--------------------------------

The amount of data in the source tables may be big.
Where should the validation job restore to ?

Considering that restore job may take non-trivial duration, how should the 
result of restore be delivered ?

Thanks

> Backup self validation for its correctness.
> -------------------------------------------
>
>                 Key: HBASE-19106
>                 URL: https://issues.apache.org/jira/browse/HBASE-19106
>             Project: HBase
>          Issue Type: Improvement
>            Reporter: Amit Kabra
>
> Backups are critical and if they don't work when we need them at the time of 
> restore than they are not useful. We should do sanity test for each backup 
> job we run that it is restorable and hence can be trusted.
> A self validation feature can be added for the same to the backups where 
> whenever a backup is run , once it finishes it will trigger a validation job 
> that will do a sample restoration of the backed up data and will make sure 
> that it compares well with actual data.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Reply via email to