This should be per region server.

Cheers

On Wed, Sep 7, 2016 at 7:19 AM, Sreeram <[email protected]> wrote:

> Hi Ted,
>
> From the link
> "Around 50-100 regions is a good number for a table with 1 or 2 column
> families. Remember that a region is a contiguous segment of a column
> family.".
>
> This number 50-100 regions per table at the level of individual region
> server or for the entire cluster ?
>
> Thanks,
> Sreeram
>
>
>
>
>
> On Wed, Sep 7, 2016 at 4:18 PM, Ted Yu <[email protected]> wrote:
>
> > With properly designed schema, you don't need to split the cluster.
> >
> > Please see:
> > http://hbase.apache.org/book.html#schema
> >
> > > On Sep 7, 2016, at 1:59 AM, Sreeram <[email protected]> wrote:
> > >
> > > Dear All,
> > >
> > >
> > >
> > > Looking forward to your views on the maximum limit of HBase cluster
> size.
> > >
> > >
> > >
> > > We are currently designing a HBase cluster and one of the tables
> > (designed
> > > in wide format) is expected to have roughly 6 billion rows in
> production
> > by
> > > 3 years (with an additional 200 million rows getting added each month).
> > In
> > > addition, we are expecting roughly 250 columns per row.  Expected table
> > > data volume is around 250 TB (at end of 3 years, without considering
> HDFS
> > > replication) and growing by 7 TB per month.
> > >
> > >
> > >
> > > While we are provisioning the number of nodes based on expected data
> > > volume, wanted to check if there are any limits on the number of rows
> per
> > > cluster.
> > >
> > >
> > >
> > > Will it be advisable to split the cluster in such situation into two or
> > > more independent clusters?  Will there be any impact to the read/write
> > > throughput/latency as the table grows over time?
> > >
> > >
> > >
> > > Please advise.
> > >
> > >
> > >
> > > Regards,
> > >
> > > Sreeram
> >
>

Reply via email to