Hi,

i'm pretty interested in learning hbase.  what i want to do is store
financial data for analytical/graphing/displaying purposes.  there hundreds
of millions of rows and of course, i want fast response when retrieving the
data.

if i were to do it in a RDBMS it would be
REPORT, MARKET, OPERATING_DATE, OPERATING_INTERVAL,     HOUR_ENDING     VALUE
where the bolded column name are PK.  if i were to store this in hbase would
it look like this?

REPORT.MARKET.OPERATING_DATE.OPERATING_INTERVAL.HOUR_ENDING.TIMESTAMP{
        VALUE: 92.29
}

so that i can do queries like below:
- give me all reports with the name of "ABC"
- give me all the values where OPERATING_DATE is from jan-01-2010 to
jan-10-2010
- give me all the values where OPERATING_DATE is from jan-01-2010 to
jan-10-2010 and HOUR_ENDING is between 5 and 10 (or simply 5 or variations
thereof)

in short, is hbase the wrong way to go about it or would it yield better
performance?  also, you folks happen to know any good links/articles on
hbase table & schema?

thanks
-- 
View this message in context: 
http://old.nabble.com/learning-hbase---schema-design-advice-tp27252203p27252203.html
Sent from the HBase User mailing list archive at Nabble.com.

Reply via email to