Tall versus wide tables in Hbase

Usman Waheed Fri, 18 Feb 2011 00:17:02 -0800

Hi,

I would like to setup an Hbase table that would provide users the abilityto perform selects only (get and scans). We don't have a need for users toperform inserts or updates at the moment. But yes i will have toload/insert the data into the tables before users can perform selects.

I can have the row key as a composite, having "brand:date:users" wherebrand is a 4 letter code for all brands, date is DD-MM-YYYY and users isthe metric (how many people bought a certain brand). This will give merather tall table which will have millions of rows and less columns (maybe2) at most.

or

Would it be better to have a wider table with the row key as users:dateonly and have the brands become a column family. There are many brands totrack on a daily basis. People using my table will need to select aparticular brand, a group or all brands to retrieve and display data.

If i recollect is it recommended to have tall tables if one is not doingatomic operations? Does a get/scan in Hbase perform any row locking?Having a tall table means more data can be spread out over regions ondifferent nodes in my cluster. I have a small test cluster of 3 nodes atthe moment.

I intend to have other metrics (quantity, price etc) and types (brand,products, campaigns etc). So my table will be gorw fast and have lots ofdata.

If i use the type (brand, campaign, product) as part of the row key thenmy inserts will be in the millions over time but if i make the type acolumn family then i will end up with wider entries and less rows.


Thanks,
Usman






--
Using Opera's revolutionary email client: http://www.opera.com/mail/

Tall versus wide tables in Hbase

Reply via email to