[
https://issues.apache.org/jira/browse/HBASE-9679?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13948775#comment-13948775
]
Lars Hofhansl commented on HBASE-9679:
--------------------------------------
I was just thinking about this. Forgot we have this issue.
We could scan through an HFileBlock as we bring it into the cache and build an
index with the offsets of each KV. Then we can binary search based on those
offsets.
> Binary search in HFile block
> ----------------------------
>
> Key: HBASE-9679
> URL: https://issues.apache.org/jira/browse/HBASE-9679
> Project: HBase
> Issue Type: Improvement
> Components: HFile
> Affects Versions: 0.95.2, 0.94.12
> Reporter: Liang Xie
> Assignee: Liang Xie
> Priority: Minor
>
> It's not a top priority issue, seems to me.
> Right now hbase do a linear scan to search a key within a hfile block on
> interst, in special case, e.g. 100% read scenario or high read/write ratio
> scanario, it's useful to do a binary search improvement to reduce the CPU
> cost and response time, i think the biggest benefit should be the cpu:)
--
This message was sent by Atlassian JIRA
(v6.2#6252)