[
https://issues.apache.org/jira/browse/HBASE-28538?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Bryan Beaudreault updated HBASE-28538:
--------------------------------------
Attachment: async-prof-pid-7-cpu-2.html
> BackupHFileCleaner.loadHFileRefs is very expensive
> --------------------------------------------------
>
> Key: HBASE-28538
> URL: https://issues.apache.org/jira/browse/HBASE-28538
> Project: HBase
> Issue Type: Bug
> Components: backup&restore
> Reporter: Bryan Beaudreault
> Priority: Major
> Attachments: async-prof-pid-7-cpu-2.html
>
>
> I noticed some odd CPU spikes on the hmasters of one of our clusters. Turns
> out it had been getting lots of bulkoads (30k) and processing them was
> expensive. The method scans hbase and then parses the paths. Surprisingly the
> parsing is more expensive than the reading hbase, with the vast majority of
> time spent in org/apache/hadoop/fs/Path.<init>.
> We should see if this is possible to be optimized. Attaching profile.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)