[ 
https://issues.apache.org/jira/browse/HBASE-8362?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13649001#comment-13649001
 ] 

Lars Hofhansl commented on HBASE-8362:
--------------------------------------

It's on my (long'ish) list of things to do.
It is not 100% clear that we would see a large improvement from this.

If the Gets in a batch a far apart (in terms of blocks) reseek won't buy much; 
if on the other hand there is a high probability that the Gets end up on the 
same block we'd expect to see an improvement.

I am not saying it should not be done, just that I do not think this should be 
generally enabled, but be at the discretion of the caller. I.e. a new API.

If ROW bloomfilters would work with scans this might be a different story. 
Another consideration could be a new scanner API (something like reseekExact or 
something), which would attempt to reseek to an exact position and indicate 
whether than succeeded or not.

                
> Possible MultiGet optimization
> ------------------------------
>
>                 Key: HBASE-8362
>                 URL: https://issues.apache.org/jira/browse/HBASE-8362
>             Project: HBase
>          Issue Type: Bug
>            Reporter: Lars Hofhansl
>
> Currently MultiGets are executed on a RegionServer in a single thread in a 
> loop that handles each Get separately (opening a scanner, seeking, etc).
> It seems we could optimize this (per region at least) by opening a single 
> scanner and issue a reseek for each Get that was requested.
> I have not tested this yet and no patch, but I would like to solicit feedback 
> on this idea.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to