I found this to be very helpful: http://data-tactics.com/techtips/cloud_data_structure_diagramming.pdf
-----Original Message----- From: canucks [mailto:anh...@gmail.com] Sent: Wednesday, January 20, 2010 10:12 PM To: hbase-user@hadoop.apache.org Subject: learning hbase - schema design advice Hi, i'm pretty interested in learning hbase. what i want to do is store financial data for analytical/graphing/displaying purposes. there hundreds of millions of rows and of course, i want fast response when retrieving the data. if i were to do it in a RDBMS it would be REPORT, MARKET, OPERATING_DATE, OPERATING_INTERVAL, HOUR_ENDING VALUE where the bolded column name are PK. if i were to store this in hbase would it look like this? REPORT.MARKET.OPERATING_DATE.OPERATING_INTERVAL.HOUR_ENDING.TIMESTAMP{ VALUE: 92.29 } so that i can do queries like below: - give me all reports with the name of "ABC" - give me all the values where OPERATING_DATE is from jan-01-2010 to jan-10-2010 - give me all the values where OPERATING_DATE is from jan-01-2010 to jan-10-2010 and HOUR_ENDING is between 5 and 10 (or simply 5 or variations thereof) in short, is hbase the wrong way to go about it or would it yield better performance? also, you folks happen to know any good links/articles on hbase table & schema? thanks -- View this message in context: http://old.nabble.com/learning-hbase---schema-design-advice-tp27252203p27252203.html Sent from the HBase User mailing list archive at Nabble.com.