Hi Lohit, I'm running HBase 0.90.2. 10 x ec2 m1.large instances. I set the heap size to 4GB and handler count for hbase, and dfs to 100. I also set the dfs max xcievers to 4096
I'm running a pure random read YCSB workload. I also tried running multiple clients from multiple ec2 instances, but that just degrades the throughput of each client. I also tried increasing the number of threads and it doesn't seem to help. Below is the output I get from YCSB: YCSB Client 0.1 Command line: -t -db com.yahoo.ycsb.db.HBaseClient -P workloads/workloadstar-100_0 -p columnfamily=data -p operationcount=120000 -s -threads 50 -target 600 [OVERALL], RunTime(ms), 246398.0 [OVERALL], Throughput(ops/sec), 487.01694007256555 [READ], Operations, 120000 [READ], AverageLatency(ms), 70.07661666666667 [READ], MinLatency(ms), 0 [READ], MaxLatency(ms), 2779 [READ], 95thPercentileLatency(ms), 393 [READ], 99thPercentileLatency(ms), 855 [READ], Return=0, 120000 Thanks, Harold --- On Mon, 5/30/11, lohit <[email protected]> wrote: > From: lohit <[email protected]> > Subject: Re: How to improve HBase throughput with YCSB? > To: [email protected] > Date: Monday, May 30, 2011, 3:38 PM > Hello Harold, > > Can you share with us what kind of throughput you are > seeing. > Number of ops/sec and read latency you are seeing. > Also, what version of hbase are you running. > > Thanks, > Lohit > > 2011/5/30 Harold Lim <[email protected]> > > > Hi All, > > > > I have an HBase cluster on ec2 m1.large instance (10 > region servers). I'm > > trying to run a read-only YCSB workload. It seems that > I can't get a good > > throughput. It saturates to around 600+ operations per > second. > > > > My dataset is around 200GB (~1k+ regions). Running > major compaction and > > also setting the handler count to 100 helped improve > the performance a > > little bit. > > > > Are there setting or configurations that I need to > set? > > > > Thanks, > > Harold > > > > > > > -- > Have a Nice Day! > Lohit >
