[jira] [Commented] (HBASE-21394) Restore snapshot in parallel

Zheng Hu (JIRA) Wed, 21 Nov 2018 06:49:14 -0800


    [ 
https://issues.apache.org/jira/browse/HBASE-21394?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16694796#comment-16694796
 ]


Zheng Hu commented on HBASE-21394:
----------------------------------

Take a look the RestoreSnapshotHelper#restoreHdfsRegions carefully,   we  have 
implemented the parallel snapshot restoring. 
{code}
public RestoreMetaChanges restoreHdfsRegions() throws IOException {
    ThreadPoolExecutor exec = SnapshotManifest.createExecutor(conf, 
"RestoreSnapshot");
    try {
      return restoreHdfsRegions(exec);
    } finally {
      exec.shutdown();
    }
  }
{code}

I think the TODO comment in RestoreSnapshotHelper#copySnapshotForScanner can be 
removed now. 
{code}
  RestoreMetaChanges metaChanges = helper.restoreHdfsRegions(); // TODO: 
parallelize.
{code}

> Restore snapshot in parallel
> ----------------------------
>
>                 Key: HBASE-21394
>                 URL: https://issues.apache.org/jira/browse/HBASE-21394
>             Project: HBase
>          Issue Type: Improvement
>            Reporter: Zheng Hu
>            Assignee: Zheng Hu
>            Priority: Major
>             Fix For: 3.0.0, 2.2.0, 2.1.2
>
>
> Our MapReduce/Spark job is highly dependent on SnapshotScanner.  When restore 
> a big table for SnapshotScanner,  it'll take hours ..
> Restore snapshot in parallel will helps a lot. 



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

[jira] [Commented] (HBASE-21394) Restore snapshot in parallel

Reply via email to