[ 
https://issues.apache.org/jira/browse/HBASE-11811?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14108464#comment-14108464
 ] 

Lars Hofhansl commented on HBASE-11811:
---------------------------------------

Yeah, PE or YCSB is the next step. I expect with the current patch scanning 
would be slower as we would build up the index every time we switch to a new 
block. Might need a flag to updateCurrentBlock to indicate whether we want to 
load the index or so.
I think the ondemand build of the index would be best (probably need to bring 
back the current seek code again in that case).

I suppose I could make a separate patch just removing code duplication between 
ReaderV2 and V3.


> Use binary search for seeking into a block
> ------------------------------------------
>
>                 Key: HBASE-11811
>                 URL: https://issues.apache.org/jira/browse/HBASE-11811
>             Project: HBase
>          Issue Type: Brainstorming
>            Reporter: Lars Hofhansl
>         Attachments: 11811-wip-v2.txt, block_index-v2.txt
>
>
> Currently upon every seek (including Gets) we need to linearly look through 
> the block from the beginning until we find the Cell we are looking for.
> It should be possible to build a simple cache of offsets of Cells for each 
> block as it is loaded and then use binary search to find the Cell in question.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to