Regions split when they are larger than the configuration parameter region size. Your data is small enough to fit on a single region.
Keys are sorted in a region. When a region splits the new regions are about half the size of the original region, and contain half the key space each. Dave -----Original Message----- From: Peter Haidinyak [mailto:[email protected]] Sent: Friday, April 22, 2011 10:41 AM To: [email protected] Subject: Row Key Question I have a question on how HBase decides to save rows based on Row Keys. Say I have a million rows to insert into a new table in a ten node cluster. Each row's key is some random 32 byte value and there are two columns per row, each column contains some random 32 byte value. My question is how does HBase know when to 'split' the table between the ten nodes? Or how does HBase 'split' the random keys between the ten nodes? Thanks -Pete
