Hi Lohit,

I'm running HBase 0.90.2. 10 x ec2 m1.large instances. I set the heap size to 
4GB and handler count for hbase, and dfs to 100. I also set the dfs max 
xcievers to 4096

I'm running a pure random read YCSB workload.

I also tried running multiple clients from multiple ec2 instances, but that 
just degrades the throughput of each client. I also tried increasing the number 
of threads and it doesn't seem to help.

Below is the output I get from YCSB:

YCSB Client 0.1
Command line: -t -db com.yahoo.ycsb.db.HBaseClient -P 
workloads/workloadstar-100_0 -p columnfamily=data -p operationcount=120000 -s 
-threads 50 -target 600
[OVERALL], RunTime(ms), 246398.0
[OVERALL], Throughput(ops/sec), 487.01694007256555
[READ], Operations, 120000
[READ], AverageLatency(ms), 70.07661666666667
[READ], MinLatency(ms), 0
[READ], MaxLatency(ms), 2779
[READ], 95thPercentileLatency(ms), 393
[READ], 99thPercentileLatency(ms), 855
[READ], Return=0, 120000

Thanks,
Harold


--- On Mon, 5/30/11, lohit <[email protected]> wrote:

> From: lohit <[email protected]>
> Subject: Re: How to improve HBase throughput with YCSB?
> To: [email protected]
> Date: Monday, May 30, 2011, 3:38 PM
> Hello Harold,
> 
> Can you share with us what kind of throughput you are
> seeing.
> Number of ops/sec and read latency you are seeing.
> Also, what version of hbase are you running.
> 
> Thanks,
> Lohit
> 
> 2011/5/30 Harold Lim <[email protected]>
> 
> > Hi All,
> >
> > I have an HBase cluster on ec2 m1.large instance (10
> region servers). I'm
> > trying to run a read-only YCSB workload. It seems that
> I can't get a good
> > throughput. It saturates to around 600+ operations per
> second.
> >
> > My dataset is around 200GB (~1k+ regions). Running
> major compaction and
> > also setting the handler count to 100 helped improve
> the performance a
> > little bit.
> >
> > Are there setting or configurations that I need to
> set?
> >
> > Thanks,
> > Harold
> >
> >
> 
> 
> -- 
> Have a Nice Day!
> Lohit
> 

Reply via email to