[ 
https://issues.apache.org/jira/browse/HBASE-23095?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Karthik Palanisamy updated HBASE-23095:
---------------------------------------
    Affects Version/s: 1.1.2
                       1.2.1
                       2.0.0

> Reuse FileStatus in StoreFileInfo
> ---------------------------------
>
>                 Key: HBASE-23095
>                 URL: https://issues.apache.org/jira/browse/HBASE-23095
>             Project: HBase
>          Issue Type: Improvement
>          Components: mob, snapshots
>    Affects Versions: 1.1.2, 1.2.1, 2.0.0, 2.2.1
>            Reporter: Karthik Palanisamy
>            Assignee: Karthik Palanisamy
>            Priority: Major
>              Labels: performance
>             Fix For: 3.0.0
>
>         Attachments: PerformanceComparision.pdf
>
>
> The performance of create snapshot on large MOB table reasonably slow because 
> there are two unnecessary calls to namenode on each Hfile, this while we 
> create snapshot manifest. The first namenode call for getting StoreFile 
> modification time 
> [link|[https://github.com/apache/hbase/blob/master/hbase-server/src/main/java/org/apache/hadoop/hbase/regionserver/StoreFileInfo.java#L139]]
>  which used for metrics and another namenode call for getting StoreFile size 
> [link|[https://github.com/apache/hbase/blob/master/hbase-server/src/main/java/org/apache/hadoop/hbase/snapshot/SnapshotManifestV2.java#L132]]
>  which used in snapshot manifest. Both calls can be avoided and this info can 
> be fetched from existing FileStatus 
> [link|[https://github.com/apache/hbase/blob/master/hbase-server/src/main/java/org/apache/hadoop/hbase/regionserver/StoreFileInfo.java#L155]].
>  
> PFA. 2x performance is seen after reusing existing FileStatus.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to