Thanks Dave for your suggestions! Will let you know if I find some approach to tackle this situation.
Regards On Mon, Nov 28, 2016 at 9:05 PM, Dave Latham <lat...@davelink.net> wrote: > If you truly have no way to predict anything about the distribution of your > data across the row key space, then you are correct that there is no way to > presplit your regions in an effective way. Either you need to make some > starting guess, such as a small number of uniform splits, or wait until you > have some information about what the data will look like. > > Dave > > On Mon, Nov 28, 2016 at 12:42 AM, Sachin Jain <sachinjain...@gmail.com> > wrote: > > > Hi, > > > > I was going though pre-splitting a table article [0] and it is mentioned > > that it is generally best practice to presplit your table. But don't we > > need to know the data in advance in order to presplit it. > > > > Question: What should be the best practice when we don't know what data > is > > going to be inserted into HBase. Essentially I don't know the key range > so > > if I specify wrong splits, then either first or last split can be a hot > > region in my system. > > > > [0]: https://hbase.apache.org/book.html#rowkey.regionsplits > > > > Thanks > > -Sachin > > >