Alex,

In HBase it really makes more sense to put all the data you can in a
single table as it will be automatically partitioned and distributed
across the region servers (providing you have more than 256MB of
data).

J-D

On Wed, Nov 11, 2009 at 3:24 AM, TuxRacer69 <tuxrace...@gmail.com> wrote:
> Hello Hbase Users,
>
> I'm trying to decide what database schema is better, more precisely how to
> partition my data.
>
> Is it better to have few tables with a lot of keys
> OR
> a lot of tables with fewer keys?
>
> For instance if I want to store are articles written by user
> Say we have U users (U=100000) and each user 'u' writes A_u articles (in
> average 10000 articles per user)
>
> Is it better to create a table per user
> e.g. table "articles_for_user_1"
> and in that table a column family 'article'
> with key based on the date of the article YYYMMDDMMss
>
> or one large table called "articles"
> whith one column family 'article'
> and keys based on user ID + date: user1_YYYMMDDMMss
>
> If I make one table per user, do I have the risk of hitting the nodes memory
> limits if the number of users grows?
> If I just have one big table, will search by key be to slow?
> I think this question reveals my lack of knowledge of of how Hbase stores
> the data...
>
>
> Thanks
> Alex
>

Reply via email to