On Wed, Aug 29, 2012 at 10:50 PM, Stack <[email protected]> wrote:

> On Wed, Aug 29, 2012 at 9:38 PM, Mohit Anchlia <[email protected]>
> wrote:
> > On Wed, Aug 29, 2012 at 9:19 PM, Stack <[email protected]> wrote:
> >
> >>  On Wed, Aug 29, 2012 at 3:56 PM, Mohit Anchlia <[email protected]
> >
> >> wrote:
> >> > If I use md5 hash + timestamp rowkey would hbase automatically detect
> the
> >> > difference in ranges and peforms split? How does split work in such
> cases
> >> > or is it still advisable to manually split the regions.
> >>
> >
> > What logic would you recommend to split the table into multiple regions
> > when using md5 hash?
> >
>
> Its hard to know how well your inserts will spread over the md5
> namespace ahead of time.  You could try sampling or just let HBase
> take care of the splits for you (Is there a problem w/ your letting
> HBase do the splits?)
>
> From what I;ve read it's advisable to do manual splits since you are able
to spread the load in more predictable way. If I am missing something
please let me know.


> St.Ack
>

Reply via email to