I think yes, because it will avoid hotspotting. I think we have a good post on that topic on Sematext Blob.
Otis -- Performance Monitoring - http://sematext.com/spm On Sep 12, 2012 3:08 AM, "Ramasubramanian" < [email protected]> wrote: > Hi thanks! But for loading data into hbase, adding hash in rowkey will > improve performance? > > Regards, > Rams > > On 12-Sep-2012, at 8:38 AM, lars hofhansl <[email protected]> wrote: > > > It depends. If you do not need to perform rangescans along (prefixes of) > your row keys, you can prefix the row key by a hash of the row key. > > That will give you a more or less random distribution of the keys and > hence not hit the same region server over and over. > > > > You'll probably also want to presplit your table then. > > > > -- Lars > > > > > > > > ----- Original Message ----- > > From: Ramasubramanian <[email protected]> > > To: [email protected] > > Cc: > > Sent: Tuesday, September 11, 2012 10:39 AM > > Subject: Regarding rowkey > > > > Hi, > > > > What can be used as rowkey to improve performance while loading into > hbase? Currently I am having sequence. It takes some 11 odd minutes to load > 1 million record with 147 columns. > > > > Regards, > > Rams >
