On Wed, Aug 29, 2012 at 10:50 PM, Stack <[email protected]> wrote: > On Wed, Aug 29, 2012 at 9:38 PM, Mohit Anchlia <[email protected]> > wrote: > > On Wed, Aug 29, 2012 at 9:19 PM, Stack <[email protected]> wrote: > > > >> On Wed, Aug 29, 2012 at 3:56 PM, Mohit Anchlia <[email protected] > > > >> wrote: > >> > If I use md5 hash + timestamp rowkey would hbase automatically detect > the > >> > difference in ranges and peforms split? How does split work in such > cases > >> > or is it still advisable to manually split the regions. > >> > > > > What logic would you recommend to split the table into multiple regions > > when using md5 hash? > > > > Its hard to know how well your inserts will spread over the md5 > namespace ahead of time. You could try sampling or just let HBase > take care of the splits for you (Is there a problem w/ your letting > HBase do the splits?) > > From what I;ve read it's advisable to do manual splits since you are able to spread the load in more predictable way. If I am missing something please let me know.
> St.Ack >
