[ 
https://issues.apache.org/jira/browse/HBASE-13761?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14557962#comment-14557962
 ] 

Vladimir Rodionov commented on HBASE-13761:
-------------------------------------------

Optimizations in this patch:

* new implementation of satisfies method using Unsafe access (8 bytes at a time)
* When there are more than one fuzzy keys, significant improvement in handling 
in getNextCellHint
* When there are more than one fuzzy keys, we keep track of a last matched 
fuzzy key and try it next time first.

Performance:

YMMV, but in my tests (RegionScanner - not ResultScanner) I observed numbers 
between 7-10% (for single fuzzy key) up to 100% (for 20 fuzzy keys). The more 
fuzzy keys in a filter - the more performance gain is. 

This filter runs at the same speed regardless of a number of fuzzy search keys: 
1, 20, 100 ... 

> Optimize FuzzyRowFilter
> -----------------------
>
>                 Key: HBASE-13761
>                 URL: https://issues.apache.org/jira/browse/HBASE-13761
>             Project: HBase
>          Issue Type: Improvement
>          Components: Filters
>    Affects Versions: 2.0.0, 1.1.0, 0.98.13
>            Reporter: Vladimir Rodionov
>            Assignee: Vladimir Rodionov
>            Priority: Minor
>             Fix For: 2.0.0, 0.98.14, 1.1.1
>
>         Attachments: HBASE-13761.patch
>
>
> FuzzyRowFilter has some room for improvements: a lot of byte-by-byte 
> arithmetic, non-efficient algorithm of selecting next candidate row etc. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to