[
https://issues.apache.org/jira/browse/HBASE-19106?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16226521#comment-16226521
]
Amit Kabra commented on HBASE-19106:
------------------------------------
The validation job should restore the data to temp table which will get deleted
once the validation job finishes.
Also we don't have to restore complete backup but we can restore some part of
it, may be 1 out of x hfiles from backups or portion of each hfile, etc.
Next level of this could be to provide validation percentage from command line
to backups eg 10% , in that case after backups 10% of backups would be
validated and only if validation passes that we call backups are successful,
else we fail the backups.
Overall, idea is to somehow check if the backups are restorable and backup
taken are the correct backups.
> Backup self validation for its correctness.
> -------------------------------------------
>
> Key: HBASE-19106
> URL: https://issues.apache.org/jira/browse/HBASE-19106
> Project: HBase
> Issue Type: Improvement
> Reporter: Amit Kabra
>
> Backups are critical and if they don't work when we need them at the time of
> restore than they are not useful. We should do sanity test for each backup
> job we run that it is restorable and hence can be trusted.
> A self validation feature can be added for the same to the backups where
> whenever a backup is run , once it finishes it will trigger a validation job
> that will do a sample restoration of the backed up data and will make sure
> that it compares well with actual data.
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)