[
https://issues.apache.org/jira/browse/HBASE-5898?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13271901#comment-13271901
]
Todd Lipcon commented on HBASE-5898:
------------------------------------
Shouldn't be hard to get that CPU time back -- we can just add an array of
cacheline-padded AtomicLongs to the cache. Whenever we add something to the
cache, we do {{changeCounters[key.hashCode() %
changeCounters.length].getAndIncrement()}}. Then change the code to:
{code}
AtomicLong changeCounter = changeCounters[key.hashCode() %
changeCounters.length];
long firstTimeChangeCounter = changeCounter.get();
first time:
try to look up in cache
if found: return it
second time:
take lock:
if changeCounter.get() == firstTimeChangeCounter: it's not in cache
otherwise: look up in cache again
{code}
We'd probably want to cache-pad the AtomicLongs too to avoid false sharing.
> Consider double-checked locking for block cache lock
> ----------------------------------------------------
>
> Key: HBASE-5898
> URL: https://issues.apache.org/jira/browse/HBASE-5898
> Project: HBase
> Issue Type: Improvement
> Components: performance
> Affects Versions: 0.94.1
> Reporter: Todd Lipcon
> Assignee: Todd Lipcon
> Priority: Critical
> Attachments: 5898-TestBlocksRead.txt, hbase-5898.txt
>
>
> Running a workload with a high query rate against a dataset that fits in
> cache, I saw a lot of CPU being used in IdLock.getLockEntry, being called by
> HFileReaderV2.readBlock. Even though it was all cache hits, it was wasting a
> lot of CPU doing lock management here. I wrote a quick patch to switch to a
> double-checked locking and it improved throughput substantially for this
> workload.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira