On Mon, Oct 24, 2011 at 1:27 AM, Sam Seigal <[email protected]> wrote: > According to the HBase book , pre splitting tables and doing manual > splits is a better long term strategy than letting HBase handle it. >
Its good for getting a table off the ground, yes. > Since I do not know what the keys from the prod system are going to > look like , I am adding a machine number prefix to the the row keys > and pre splitting the tables based on the prefix (prefix 0 goes to > machine A, prefix 1 goes to machine b etc). > You don't need to do inorder scan of the data? Whats the rest of your row key look like? > Once I decide to add more machines, I can always do a rolling split > and add more prefixes. > Yes. > Is this a good strategy for pre splitting the tables ? > So, you'll start out with one region per server? What do you think the rate of splitting will be like? Are you using default region size or have you bumped this up? St.Ack
