[ 
https://issues.apache.org/jira/browse/HBASE-8001?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13596868#comment-13596868
 ] 

Lars Hofhansl commented on HBASE-8001:
--------------------------------------

In my scenario I can not measure an improvement:
* all data in the blockcache
* 40m small KVs (8 byte keys, 20 byte values) across two CFs
* scan + filter where filter filters everything at the server
* column family with a single column
* VERSIONS=1
* table is fully compacted

Tests:
* adding a single family to Scan object: 11.8
* adding the family+column to the Scan object: 13.1

I get the same numbers with or without the patch. The 2nd number should have 
improved.

                
> Avoid unnecessary lazy seek
> ---------------------------
>
>                 Key: HBASE-8001
>                 URL: https://issues.apache.org/jira/browse/HBASE-8001
>             Project: HBase
>          Issue Type: Improvement
>          Components: regionserver
>    Affects Versions: 0.94.5
>            Reporter: Raymond Liu
>            Assignee: Raymond Liu
>             Fix For: 0.98.0
>
>         Attachments: HBASE-8001_onescanner.patch
>
>
> Lazy seek helps to reduce the real seek needed for multi hfile, when the kv 
> from newer hfile is enough to satisfy the query.
> While in many case, it just push the real seek later, and do not reduce the 
> number of real seek. e.g. there are only one hfile, or storefilescanner is 
> closed and only one left, or the scan need to go through all the versions, or 
> there are only one version of row and a sequence scan is performed. In these 
> case, lazy seek just bring extra overhead.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to