Optimizations for random read performance

James Baldassari Mon, 15 Feb 2010 15:46:31 -0800

Hi,

Does anyone have any tips to share regarding optimization for random
read performance?  For writes I've found that setting a large write
buffer and setting auto-flush to false on the client side significantly
improved put performance.  Are there any similar easy tweaks to improve
random read performance?


I'm using HBase 0.20.3 in a very read-heavy real-time system with 1
master and 3 region servers.  It was working ok for a while, but today
there was a severe degradation in read performance.  Restarting Hadoop
and HBase didn't help, are there are no errors in the logs.  Read
performance starts off around 1,000-2,000 gets/second but quickly
(within minutes) drops to around 100 gets/second.

I've already looked at the performance tuning wiki page.  On the server
side I've increased hbase.regionserver.handler.count from 10 to 100, but
it didn't help.  Maybe this is expected because I'm only using a single
client to do reads.  I'm working on implementing a client pool now, but
I'm wondering if there are any other settings on the server or client
side that might improve things.

Thanks,
James

Optimizations for random read performance

Reply via email to