[ https://issues.apache.org/jira/browse/HBASE-21394?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16694796#comment-16694796 ]
Zheng Hu commented on HBASE-21394: ---------------------------------- Take a look the RestoreSnapshotHelper#restoreHdfsRegions carefully, we have implemented the parallel snapshot restoring. {code} public RestoreMetaChanges restoreHdfsRegions() throws IOException { ThreadPoolExecutor exec = SnapshotManifest.createExecutor(conf, "RestoreSnapshot"); try { return restoreHdfsRegions(exec); } finally { exec.shutdown(); } } {code} I think the TODO comment in RestoreSnapshotHelper#copySnapshotForScanner can be removed now. {code} RestoreMetaChanges metaChanges = helper.restoreHdfsRegions(); // TODO: parallelize. {code} > Restore snapshot in parallel > ---------------------------- > > Key: HBASE-21394 > URL: https://issues.apache.org/jira/browse/HBASE-21394 > Project: HBase > Issue Type: Improvement > Reporter: Zheng Hu > Assignee: Zheng Hu > Priority: Major > Fix For: 3.0.0, 2.2.0, 2.1.2 > > > Our MapReduce/Spark job is highly dependent on SnapshotScanner. When restore > a big table for SnapshotScanner, it'll take hours .. > Restore snapshot in parallel will helps a lot. -- This message was sent by Atlassian JIRA (v7.6.3#76005)