[
https://issues.apache.org/jira/browse/HBASE-23095?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Karthik Palanisamy updated HBASE-23095:
---------------------------------------
Affects Version/s: 1.1.2
1.2.1
2.0.0
> Reuse FileStatus in StoreFileInfo
> ---------------------------------
>
> Key: HBASE-23095
> URL: https://issues.apache.org/jira/browse/HBASE-23095
> Project: HBase
> Issue Type: Improvement
> Components: mob, snapshots
> Affects Versions: 1.1.2, 1.2.1, 2.0.0, 2.2.1
> Reporter: Karthik Palanisamy
> Assignee: Karthik Palanisamy
> Priority: Major
> Labels: performance
> Fix For: 3.0.0
>
> Attachments: PerformanceComparision.pdf
>
>
> The performance of create snapshot on large MOB table reasonably slow because
> there are two unnecessary calls to namenode on each Hfile, this while we
> create snapshot manifest. The first namenode call for getting StoreFile
> modification time
> [link|[https://github.com/apache/hbase/blob/master/hbase-server/src/main/java/org/apache/hadoop/hbase/regionserver/StoreFileInfo.java#L139]]
> which used for metrics and another namenode call for getting StoreFile size
> [link|[https://github.com/apache/hbase/blob/master/hbase-server/src/main/java/org/apache/hadoop/hbase/snapshot/SnapshotManifestV2.java#L132]]
> which used in snapshot manifest. Both calls can be avoided and this info can
> be fetched from existing FileStatus
> [link|[https://github.com/apache/hbase/blob/master/hbase-server/src/main/java/org/apache/hadoop/hbase/regionserver/StoreFileInfo.java#L155]].
>
> PFA. 2x performance is seen after reusing existing FileStatus.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)