[ 
https://issues.apache.org/jira/browse/HBASE-5489?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13220327#comment-13220327
 ] 

[email protected] commented on HBASE-5489:
------------------------------------------------------



bq.  On 2012-03-01 06:44:36, Lars Hofhansl wrote:
bq.  > Looks good to me.
bq.  > Curious: Do you have a specific usecase in mind for this API?
bq.  
bq.  David Wang wrote:
bq.      Yes, I would like to not have to be forced to scan .META. everytime my 
client just wants the regions for a particular range, and that information is 
already cached in the client.  This is also more convenient for the caller than 
having to parse through all of the start/end keys in the table everytime.
bq.  
bq.  Lars Hofhansl wrote:
bq.      Wait. TableInputFormat is already configured with a Scan object, which 
do exactly the same thing (via a scanner).
bq.      You don't special InputFormat for this.

Sorry, that last was in response to your email where you say that you want to 
"make a TableInputFormat equivalent that only scans a sub-range of the table"


- Lars


-----------------------------------------------------------
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/4117/#review5490
-----------------------------------------------------------


On 2012-03-01 18:24:18, David Wang wrote:
bq.  
bq.  -----------------------------------------------------------
bq.  This is an automatically generated e-mail. To reply, visit:
bq.  https://reviews.apache.org/r/4117/
bq.  -----------------------------------------------------------
bq.  
bq.  (Updated 2012-03-01 18:24:18)
bq.  
bq.  
bq.  Review request for hbase.
bq.  
bq.  
bq.  Summary
bq.  -------
bq.  
bq.  getRegionsInRange() will retrieve the HRegionLocations for the regions 
associated with the specified key range, using client-side cache if possible.
bq.  
bq.  I have one question: right now the endKey specified to getRegionsInRange() 
is treated as inclusive.  I followed the behavior that I saw in 
HRegionInfo.containsRange().  However, other HBase code such as Scan treats the 
endKey as exclusive.  So I am not clear as to which way we should go here.  I 
can easily change the patch if we want the endKey to be exclusive; please let 
me know.  Thanks in advance.
bq.  
bq.  
bq.  This addresses bug HBASE-5489.
bq.      https://issues.apache.org/jira/browse/HBASE-5489
bq.  
bq.  
bq.  Diffs
bq.  -----
bq.  
bq.    src/main/java/org/apache/hadoop/hbase/client/HTable.java 29b8004 
bq.    src/test/java/org/apache/hadoop/hbase/client/TestFromClientSide.java 
bdeaefe 
bq.  
bq.  Diff: https://reviews.apache.org/r/4117/diff
bq.  
bq.  
bq.  Testing
bq.  -------
bq.  
bq.  Ran the TestFromClientSide unit tests and passed repeatedly.
bq.  
bq.  Ran test-patch.sh with the following results:
bq.  
bq.  -1 overall.  
bq.  
bq.      +1 @author.  The patch does not contain any @author tags.
bq.  
bq.      +1 tests included.  The patch appears to include 3 new or modified 
tests.
bq.  
bq.      -1 javadoc.  The javadoc tool appears to have generated -129 warning 
messages.
bq.  
bq.      +1 javac.  The applied patch does not increase the total number of 
javac compiler warnings.
bq.  
bq.      +1 findbugs.  The patch does not introduce any new Findbugs (version ) 
warnings.
bq.  
bq.      +1 release audit.  The applied patch does not increase the total 
number of release audit warnings.
bq.  
bq.  
bq.  Thanks,
bq.  
bq.  David
bq.  
bq.


                
> Add HTable accessor to get regions for a key range
> --------------------------------------------------
>
>                 Key: HBASE-5489
>                 URL: https://issues.apache.org/jira/browse/HBASE-5489
>             Project: HBase
>          Issue Type: Improvement
>          Components: client
>            Reporter: David S. Wang
>            Assignee: David S. Wang
>            Priority: Minor
>             Fix For: 0.92.1, 0.94.0
>
>         Attachments: HBASE-5489-2.patch, HBASE-5489-3-0.92.1.patch, 
> HBASE-5489-3.patch
>
>
> It would be nice to have an accessor to find all regions that overlap with a 
> particular range of keys. Right now, the only way to accomplish that is to 
> call HTable.getStartEndKeys(), then follow that with calls to 
> getRegionLocation() for the range of keys you are interested in.  This 
> algorithm has 2 drawbacks:
> * It returns more keys than is necessary most of the time.  This is 
> especially evident if there are a lot of regions comprising the table and the 
> range of keys is small.
> * It always does a scan of .META. via MetaScannerVisitor for at least 
> HTable.getStartEndKeys(), and perhaps for HRegionLocations that are not 
> already cached by the client.
> An accessor that limited its scans to a specified range could avoid scanning 
> .META. at all if the HRegionLocations being fetched were already cached by 
> the client, thereby potentially making this operation faster in common cases.
> Here's a proposal for the accessor:
>   /**
>    * Get the corresponding regions for an arbitrary range of keys.
>    * <p>
>    * @param startRow Starting row in range, inclusive
>    * @param endRow Ending row in range, inclusive
>    * @return A list of HRegionLocations corresponding to the regions that
>    * contain the specified range
>    * @throws IOException if a remote or network exception occurs
>    */
>   public List<HRegionLocation> getRegionsInRange(final byte [] startKey,
>     final byte [] endKey) throws IOException

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to