On Mon, Oct 24, 2011 at 1:27 AM, Sam Seigal <[email protected]> wrote:
> According to the HBase book , pre splitting tables and doing manual
> splits is a better long term strategy than letting HBase handle it.
>

Its good for getting a table off the ground, yes.


> Since I do not know what the keys from the prod system are going to
> look like , I am adding a machine number prefix to the the row keys
> and pre splitting the tables  based on the prefix (prefix 0 goes to
> machine A, prefix 1 goes to machine b etc).
>

You don't need to do inorder scan of the data?  Whats the rest of your
row key look like?


> Once I decide to add more machines, I can always do a rolling split
> and add more prefixes.
>

Yes.

> Is this a good strategy for pre splitting the tables ?
>

So, you'll start out with one region per server?

What do you think the rate of splitting will be like?  Are you using
default region size or have you bumped this up?

St.Ack

Reply via email to