[
https://issues.apache.org/jira/browse/HBASE-21394?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17063006#comment-17063006
]
Xu Cang commented on HBASE-21394:
---------------------------------
While I am debugging snapshot related issue. I found this JIRA.
>From my observation, this method : RestoreSnapshotHelper#restoreHdfsRegions()
>will always try to iterate all regions and open all hfiles for the table from
>all mappers.
So suppose we have 500 mappers scanning snapshot of the table, all 500 mappers
are iterating all regions/hfiles. Was this the same symptom you saw and was
that by design? (BTW, I am using branch-1 code, haven't tried this parallel
improvements).
[~openinx]
Thanks!
> Restore snapshot in parallel
> ----------------------------
>
> Key: HBASE-21394
> URL: https://issues.apache.org/jira/browse/HBASE-21394
> Project: HBase
> Issue Type: Improvement
> Reporter: Zheng Hu
> Assignee: Zheng Hu
> Priority: Major
> Fix For: 3.0.0, 2.2.0, 2.1.2
>
>
> Our MapReduce/Spark job is highly dependent on SnapshotScanner. When restore
> a big table for SnapshotScanner, it'll take hours ..
> Restore snapshot in parallel will helps a lot.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)