The thread ended abruptly..Can some suggest a solution for improved random read performance...Even we are facing a similar issue with multi-gets on a table with growing data...
On Wed, Apr 17, 2013 at 6:03 PM, Michel Segel <[email protected]>wrote: > Wouldn't do that... Changing block size is the last thing you want to do. > > First question... > > What is your key? > > Second... > What is your record size that you are attempting to read. > > > Third... > Compare the 10k multiget versus 10k individual gets. > > Fourth are your random keys sorted? > If not, try sorting them... > > There are a lot of issues that can affect performance .... > > > > Sent from a remote device. Please excuse any typos... > > Mike Segel > > On Apr 15, 2013, at 3:17 AM, Anoop Sam John <[email protected]> wrote: > > > Ankit > > I guess you might be having default HFile block size > which is 64KB. > > For random gets a lower value will be better. Try will some thing like > 8KB and check the latency? > > > > Ya ofcourse blooms can help (if major compaction was not done at the > time of testing) > > > > -Anoop- > > ________________________________________ > > From: Ankit Jain [[email protected]] > > Sent: Saturday, April 13, 2013 11:01 AM > > To: [email protected] > > Subject: HBase random read performance > > > > Hi All, > > > > We are using HBase 0.94.5 and Hadoop 1.0.4. > > > > We have HBase cluster of 5 nodes(5 regionservers and 1 master node). Each > > regionserver has 8 GB RAM. > > > > We have loaded 25 millions records in HBase table, regions are pre-split > > into 16 regions and all the regions are equally loaded. > > > > We are getting very low random read performance while performing multi > get > > from HBase. > > > > We are passing random 10000 row-keys as input, while HBase is taking > around > > 17 secs to return 10000 records. > > > > Please suggest some tuning to increase HBase read performance. > > > > Thanks, > > Ankit Jain > > iLabs > > > > > > > > -- > > Thanks, > > Ankit Jain > -- Thank you Kiran Sarvabhotla -----Even a correct decision is wrong when it is taken late
