[ 
https://issues.apache.org/jira/browse/HBASE-5032?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Adela Maznikar resolved HBASE-5032.
-----------------------------------

       Resolution: Fixed
    Fix Version/s: 0.89-fb
    
> Add other DELETE type information into the delete bloom filter to optimize 
> the time range query
> -----------------------------------------------------------------------------------------------
>
>                 Key: HBASE-5032
>                 URL: https://issues.apache.org/jira/browse/HBASE-5032
>             Project: HBase
>          Issue Type: Improvement
>            Reporter: Liyin Tang
>            Assignee: Adela Maznikar
>             Fix For: 0.89-fb
>
>
> To speed up time range scans we need to seek to the maximum timestamp of the 
> requested range,instead of going to the first KV of the (row, column) pair 
> and iterating from there. If we don't know the (row, column), e.g. if it is 
> not specified in the query, we need to go to end of the current row/column 
> pair first, get a KV from there, and do another seek to (row', column', 
> timerange_max) from there. We can only skip over to the timerange_max 
> timestamp when we know that there are no DeleteColumn records at the top of 
> that row/column with a higher timestamp. We can utilize another Bloom filter 
> keyed on (row, column) to quickly find that out. (From HBASE-4962)
> So the motivation is to save seek ops for scanning time-range queries if we 
> know there is no delete for this row/column. 
> From the implementation perspective, we have already had a delete family 
> bloom filter which contains all the delete family key values. So we can reuse 
> the same bloom filter for all other kinds of delete information such as 
> delete columns or delete. 

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to