I have come across clusters with 100s of tables but that typically is due to a sub optimal table design.
The question here is - why do you need to distribute your data over lots of tables? What's your access pattern and what kind of data are you putting in? Or is this just a theoretical question? On Jul 13, 2012, at 12:05 AM, Adrien Mogenet <[email protected]> wrote: > Hi there, > > I read some good practices about number of columns / column families, but > nothing about the number of tables. > What if I need to spread my data among hundred or thousand (big) tables ? > What should I care about ? I guess I should keep a tight number of > storeFiles per RegionServer ? > > -- > Adrien Mogenet > http://www.mogenet.me
