[jira] [Commented] (HBASE-18806) VerifyRep by snapshot need not to restore snapshot for each mapper

Zheng Hu (JIRA) Tue, 02 Jan 2018 20:04:05 -0800

    [ 
https://issues.apache.org/jira/browse/HBASE-18806?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16309121#comment-16309121
 ]


Zheng Hu commented on HBASE-18806:
----------------------------------

I've port this patch into our cluster until now.   In our online cluster,   I 
have a table with 512 regions , 4296 hfiles.    It tooks about 36 min to 
restore the source snapshot and the peer snapshot in total ,  and after that, 
the verify replication start  scan rows  by reading snapshot . 

However, without this patch,  then tooks about 17min  to restore the source 
snapshot , and after the job submitted,   each mapper will take about 20 min to 
restore the peer snapshot (so many redundant rpc to HDFS cluster),  the verify 
replication will start to read rows in the final .   

[~tedyu], [~zghaobac],  FYI  

I plan to push those verifyReplication patches into master & branch-2 before 
2.0.0-beta-2,  other relative issues are : HBASE-18452 , HBASE-18484. 

Thanks. 

> VerifyRep by snapshot need not to restore snapshot for each mapper
> ------------------------------------------------------------------
>
>                 Key: HBASE-18806
>                 URL: https://issues.apache.org/jira/browse/HBASE-18806
>             Project: HBase
>          Issue Type: Improvement
>          Components: Replication
>    Affects Versions: 2.0.0-alpha-2
>            Reporter: Zheng Hu
>            Assignee: Zheng Hu
>         Attachments: HBASE-18806.v1.patch, HBASE-18806.v2.patch, 
> HBASE-18806.v3.patch, HBASE-18806.v3.patch
>
>
> In following method stack,  seems like each mapper task will restore the 
> snapshot.   If we verify replication  by a snapshot which has many hfiles,  
> then we will take long time to restore snapshot.   In our  cluster,   we took 
> ~30min for the snapshot restoring when verify a big table.
> {code}
> Verifier.map
>     |-------->  replicatedScanner = new TableSnapshotScanner(...)
>                                                                 |-------->  
> TableSnapshotScanner.init()
>                                                                               
>             |-----> RestoreSnapshotHelper.copySnapshotForScanner
> {code} 



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

[jira] [Commented] (HBASE-18806) VerifyRep by snapshot need not to restore snapshot for each mapper

Reply via email to