On Wed, Aug 29, 2012 at 9:38 PM, Mohit Anchlia <mohitanch...@gmail.com> wrote:
> On Wed, Aug 29, 2012 at 9:19 PM, Stack <st...@duboce.net> wrote:
>
>>  On Wed, Aug 29, 2012 at 3:56 PM, Mohit Anchlia <mohitanch...@gmail.com>
>> wrote:
>> > If I use md5 hash + timestamp rowkey would hbase automatically detect the
>> > difference in ranges and peforms split? How does split work in such cases
>> > or is it still advisable to manually split the regions.
>>
>
> What logic would you recommend to split the table into multiple regions
> when using md5 hash?
>

Its hard to know how well your inserts will spread over the md5
namespace ahead of time.  You could try sampling or just let HBase
take care of the splits for you (Is there a problem w/ your letting
HBase do the splits?)

St.Ack

Reply via email to