This is an HBase schema design question. Suppose I store blog enty details
in an hbase table:
blogid, blog_content, blog_author, blog_subject.

My query is such that it always retrieves all this data at the same time.

So is it a better idea to store all this in a single json/protobuf object or
actually separate out the details into column families?

Option1:

Table          RowKey          Column Family          Value
Blogs          BlogId                   Details                JSON(Content,
Author, Subject)

Option2:

Table          RowKey          Column Family
Blogs           BlogId                   Content
                                                Author
                                                Subject


I was thinking of option1 because it seems it might be faster since all
details will be physically stored together. But option2 is what seems to be
the trend when I look at other basic HBase schema examples out there.

Please let me know opinions and if I am on the right track...

Thanks in advance.

Reply via email to