[ 
https://issues.apache.org/jira/browse/HBASE-9915?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13816350#comment-13816350
 ] 

Lars Hofhansl commented on HBASE-9915:
--------------------------------------

[~jmspaggi] Yeah, did some count queries in Phoenix. Without patch they took 
27s with the patch they take 14s. You'll see the improvement if (1) you use a 
block encoder (like FAST_DIFF, etc) and (2) you added some columns to you scan 
object (so that the ExplicitColumnTracker is used under the hood). I am not 
sure any the the performance evaluation tests do that (and if not, we should 
probably add that).

Making a trunk patch now for a full test run.

> Severe performance bug: isSeeked() in EncodedScannerV2 is always false
> ----------------------------------------------------------------------
>
>                 Key: HBASE-9915
>                 URL: https://issues.apache.org/jira/browse/HBASE-9915
>             Project: HBase
>          Issue Type: Bug
>            Reporter: Lars Hofhansl
>            Assignee: Lars Hofhansl
>             Fix For: 0.98.0, 0.96.1, 0.94.14
>
>         Attachments: 9915-0.94-v2.txt, 9915-0.94.txt, profile.png
>
>
> While debugging why reseek is so slow I found that it is quite broken for 
> encoded scanners.
> The problem is this:
> AbstractScannerV2.reseekTo(...) calls isSeeked() to check whether scanner was 
> seeked or not. If it was it checks whether the KV we want to seek to is in 
> the current block, if not it always consults the index blocks again.
> isSeeked checks the blockBuffer member, which is not used by EncodedScannerV2 
> and thus always returns false, which in turns causes an index lookup for each 
> reseek.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

Reply via email to