[jira] [Commented] (HBASE-15331) HBase Backup/Restore Phase 2: Optimized Restore operation

Vladimir Rodionov (JIRA) Fri, 11 Mar 2016 11:00:50 -0800

    [ 
https://issues.apache.org/jira/browse/HBASE-15331?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15191388#comment-15191388
 ]


Vladimir Rodionov commented on HBASE-15331:
-------------------------------------------

Partially resolved (in HBASE-14123-v11 patch). 

Now we restore all intermediate incremental images in a single M/R job instead 
of multiple (one per image) and we use bulk load and nor streaming puts. 





> HBase Backup/Restore Phase 2: Optimized Restore operation
> ---------------------------------------------------------
>
>                 Key: HBASE-15331
>                 URL: https://issues.apache.org/jira/browse/HBASE-15331
>             Project: HBase
>          Issue Type: Improvement
>            Reporter: Vladimir Rodionov
>            Assignee: Vladimir Rodionov
>
> The current implementation for restore uses WALReplay M/R job. This has 
> performance and stability problems, since it uses HBase client API to insert 
> data. We have to migrate to bulk load approach: generate hfiles directly from 
> snapshot and incremental images. We run separate M/R job for every backup 
> image between last FULL backup and current incremental backup we restore to 
> and for every table in the list (image). If we have 10 tables and 30 days of 
> incremental backup images - this results in 30x10 = 300 M/R jobs. MUST be 
> optimized.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-15331) HBase Backup/Restore Phase 2: Optimized Restore operation

Reply via email to