Thanks! Few more doubts 1.When I don't supply SPLITS at table creation , all put operation will go to one region only. But when region grows more than hbase.hregion.max.filesize , then 2 regions will be created both have half-half data or another will be empty initially? 2.If both have 50-50% data and row key is monotonically increasing then 1 region will be half filled always and will never be filled again ? 3.While prespliting table only way is to specify row boundaries and key prefixes ?Say if i don't know key ranges , as in my case its GUID hexadecimal 32 character string , what should be region split boundary ? and How many splits should be created - is it equal to no of regionserver aka datanodes ? 4.For keys of type ACTIVITYTYPE-DATE (where activity type has 2 values 1.login 2.logout) what should be split strategy ?
On Tue, Jul 15, 2014 at 7:03 PM, Ted Yu <[email protected]> wrote: > Shushant: > For #2, if table has only one region, the hosting region server would > receive all writes. > For #4, yes - presplitting goes with fixed number of regions. > > Cheers > > > On Tue, Jul 15, 2014 at 6:23 AM, sudhakara st <[email protected]> > wrote: > > > You can find info here > > http://hbase.apache.org/book/rowkey.design.html#rowkey.regionsplits > > http://hortonworks.com/blog/apache-hbase-region-splitting-and-merging/ > > > > > > On Tue, Jul 15, 2014 at 6:40 PM, Shushant Arora < > [email protected] > > > > > wrote: > > > > > 1.How to split region servers at table definition time? > > > > > > 2.Will hbase write onto only one region server when no splits are > defined > > > even if key is not monotonically increasing? > > > > > > 3. When does a region split occurs. > > > > > > 4. Will no of regions be fixed when hbase table is presplitted at table > > > creation time. > > > > > > > > > > > -- > > > > Regards, > > ...sudhakara > > >
