This information isn’t publicly available to us either. I think our experience is we have a distribution of cluster sizes that fall along three orders of magnitude:
10-100: Most start here. 100-1000: Mid size and enterprise. 1000-10000: Peak scale internet. I think a cluster of size 5000 nodes or larger hasn’t ever been done but like I said we don’t really know. Most organizations that use HBase at scale run more than one cluster, so the total amount under storage can be in the peak scale range (1k-10k) but the largest particular cluster is more like 100-500 nodes (my employer falls into this bucket). A context free discussion on cluster sizing is rarely useful. What you might need, and if you even need something like HBase, is going to depend on your requirements. Basically: Try everything else first, and when you need us, you’ll know when they won’t work for you. On Thu, Mar 21, 2019 at 8:35 AM dongda...@gmail.com <dongda...@gmail.com> wrote: > Hi, Experts, > Does anybody know what is the largest HBase cluster ever used by any > company? The only info that we can get is a few PB of size in year 2011 by > google. But what is the current status after so many years of technical > evolvement of HBase(e.g., can HBase support ~100PB storage?)? Thanks so > much! > > Cheers, > Dan > -- Best regards, Andrew Words like orphans lost among the crosstalk, meaning torn from truth's decrepit hands - A23, Crosstalk