On Fri, May 6, 2011 at 11:19 AM, pob <[email protected]> wrote: > > > The data for those regions is replicated, but only 1 region server > > does the management of that data. > > > > > So does it mean, there isnt "scalling for reads"? {mean higher replica -> > better read throughput} >
Reads are scaled by splitting regions and distributing them around multiple servers. If you have one super-hot row, it should fit in cache and give you some >20k reads/second. If you need more reads/sec on a single row than that, you'll need to add your own caching layer in front. -Todd -- Todd Lipcon Software Engineer, Cloudera
