I have come across clusters with 100s of tables but that typically is
due to a sub optimal table design.

The question here is - why do you need to distribute your data over
lots of tables? What's your access pattern and what kind of data are
you putting in? Or is this just a theoretical question?

On Jul 13, 2012, at 12:05 AM, Adrien Mogenet <[email protected]> wrote:

> Hi there,
>
> I read some good practices about number of columns / column families, but
> nothing about the number of tables.
> What if I need to spread my data among hundred or thousand (big) tables ?
> What should I care about ? I guess I should keep a tight number of
> storeFiles per RegionServer ?
>
> --
> Adrien Mogenet
> http://www.mogenet.me

Reply via email to