Thanks Nicolas for the clarification.  I had a follow-up query.

What will happen if we increased the region size, say from current value of 256 
MB to a new value of 2GB?
Will existing regions continue to use only 256 MB space?  

Is there a way to reorganize the regions so that each regions grows to 2GB size?

Thanks,
Srikanth

-----Original Message-----
From: Nicolas Spiegelberg [mailto:nspiegelb...@fb.com] 
Sent: Tuesday, November 22, 2011 10:59 PM
To: user@hbase.apache.org
Subject: Re: Region Splits

No.  The purpose of major compactions is to merge & dedupe within a region
boundary.  Compactions will not alter region boundaries, except in the
case of splits where a compaction is necessary to filter out any Rows from
the parent region that are no longer applicable to the daughter region.

On 11/22/11 9:04 AM, "Srikanth P. Shreenivas"
<srikanth_shreeni...@mindtree.com> wrote:

>Will major compactions take care of merging "older" regions or adding
>more key/values to them as number of regions grow?
>
>Regard,
>Srikanth
>
>-----Original Message-----
>From: Amandeep Khurana [mailto:ama...@gmail.com]
>Sent: Monday, November 21, 2011 7:25 AM
>To: user@hbase.apache.org
>Subject: Re: Region Splits
>
>Mark,
>
>Yes, your understanding is correct. If your keys are sequential
>(timestamps
>etc), you will always be writing to the end of the table and "older"
>regions will not get any writes. This is one of the arguments against
>using
>sequential keys.
>
>-ak
>
>On Sun, Nov 20, 2011 at 11:33 AM, Mark <static.void....@gmail.com> wrote:
>
>> Say we have a use case that has sequential row keys and we have rows
>> 0-100. Let's assume that 100 rows = the split size. Now when there is a
>> split it will split at the halfway mark so there will be two regions as
>> follows:
>>
>> Region1 [START-49]
>> Region2 [50-END]
>>
>> So now at this point all inserts will be writing to Region2 only
>>correct?
>> Now at some point Region2 will need to split and it will look like the
>> following before the split:
>>
>> Region1 [START-49]
>> Region2 [50-150]
>>
>> After the split it will look like:
>>
>> Region1 [START-49]
>> Region2 [50-100]
>> Region3 [150-END]
>>
>> And this pattern will continue correct? My question is when there is a
>>use
>> case that has sequential keys how would any of the older regions every
>> receive anymore writes? It seems like they would always be stuck at
>> MaxRegionSize/2. Can someone please confirm or clarify this issue?
>>
>> Thanks
>>
>>
>>
>>
>>
>
>________________________________
>
>http://www.mindtree.com/email/disclaimer.html

Reply via email to