Hi Alex, 

Yes that is right. Before I can recommend either way I need to know how you 
access your data. How often do you access older versions and are you accessing 
them separately or are you reading multiple versions in one go?

Lars

On Nov 25, 2010, at 21:22, Nanheng Wu <[email protected]> wrote:

> Hi Lars,
> 
>  Thank you so much for the response. So if I understand correctly, if
> I want to use columns for my use-case I would keep adding columns to
> the row during each load where the column name has the version
> information, is that correct? And if I want to use row keys I can just
> append the version to the keys themselves? Considering that I will
> have a pretty large data to load everyday, and occasionally need to
> delete some older versions of data to save space, do you have some
> recommendation on which option might work better?
> 
> Thanks again,
> Alex
> 
> 
> On Thu, Nov 25, 2010 at 10:18 AM, Lars George <[email protected]> wrote:
>> Hi Alex,
>> 
>> Oh no, you do NOT want to use column families that way. The are semi static 
>> and should not be changed too often nor should there be too many. Adding a 
>> CF requires disabling the table too.
>> 
>> Use columns, row keys or timestamps for that use-case.
>> 
>> Lars
>> 
>> On Nov 25, 2010, at 17:31, Nanheng Wu <[email protected]> wrote:
>> 
>>> Hello,
>>> 
>>>  I am very new to HBase and I hope to get some feedback from the
>>> community on this: I want to use HBase to store some data with pretty
>>> simple structure: each key has ~50 attributes. These data are computed
>>> daily and loaded into HBase everyday. Almost all of the keys will have
>>> updated values for some attributes, some keys may be delete and some
>>> may be added. What I'd like to have is versioning on the dataset,
>>> HBase will only serve queries using one of the versions and I will
>>> have metadata to keep track of which version should be used. My
>>> question is should I use a ColumnFamily for each version? I would need
>>> to create new ColumnFamilies on every load, and occasionally remove
>>> them if they are too old. Are ColumnFamilies meant to be used this
>>> way?
>>> 
>>> Thanks!
>>> Alex
>> 

Reply via email to