Re: HBase transactions model question

Rick Hangartner Mon, 21 Jul 2008 14:55:30 -0700

Hi Jean-Daniel,

Thanks for the info on the HBase transaction model.

I'm not sure I quite understand your subsequent question though, andI'm interested in learning whether you are suggesting a preferredalternative to what we are doing.

To set the background, I should begin by noting that by the nature ofour application, the table in this case will always be dense. Weprobably are not using HBase optimally in that regard, since such datacould be stored in an RDBMs. However, we mainly want to use HBasebecause of the scale, and to run map-reduce across the data set fordifferent computations where RDBMs data-mining tools would be bothoverkill in some cases and inefficient in others.

We have used the transaction model described for several reasons. Bycreating a column family ("key"), which has numerous column members("key:c1", "key:c2", etc), we have a complex key for a column family("value") that we can filter on in the map phase of map-reducecomputations. Each row in the HBase table corresponds to a data setfor an entity in our system, which it seems would be stored relativelyefficiently in the HFS. Timestamps and versions become relevant asindices for data whose temporal course is of interest. (Figure 1 inthe seminal Google paper "Bigtable, A Distributed Storage System forStructured Data" is in fact a good picture of what we are doing ---except our rows are dense in the sense that every column has an entryfor every timestamp because of the nature of our data.)

The ability to insure that all columns written in a single transactionare returned together also allows us to retrieve data sequences ofinterest to us in non-map-reduce environments. In this case, we haveused the transaction model described because of the issue ofapparently not being able to easily retrieve all versions of twodifferent columns in a row, along with timestamps, with the get(),getRow(), and obtainScanner(), methods the HTable v0.1.x nativeclient. By including a "key:timestamp" column in the "key" column-family, we can get an explicit timestamp value that can be combinedwith the row key value as a key for fast retrieval of column values inthe "value" column-family. In that sense we are using a column key(the "timestamp" column), which just happens to be a timestamp, toaccess other other column values in the row.

Which actually brings me to another question. I think our modelhighlights something about "timestamps" and "versions" which is notquite clear. Namely, the Google figure and the model I've describedsort of implicitly assumes that row key + timestamp forms a unique keyfor an entry in each column. In everything else I can find,"versions" generally refer to a set of entries in a column (row-columncell) where each value has a unique timestamp. But if timestamp isnot stored with sufficient resolution, e.g. 1 sec resolution, one canpostulate that a situation in which two put()s close enough in timethat two entries in a cells could in theory have the same row key +timestamp. This suggests that set of versions of a cell are not 1-1with the set of unique timestamps on the cell entries. Put anotherway, could one retrieve X versions of a row-column cell, but only findY < X unique timestamps on those X versions of the cell? Is there anyadditional explanation about versions viz a viz timestamps you canpoint me to that helps sort this out better?

So finally, given all this, are you thinking about other ways to usecolumn keys rather than timestamps we should be considering?


Thanks,
Rick

On Jul 21, 2008, at 8:23 AM, Jean-Daniel Cryans wrote:

Rick,
Yes and yes, but why not using the column keys instead of thetimestamps?
J-D
On Sat, Jul 19, 2008 at 1:47 AM, Rick Hangartner <[EMAIL PROTECTED]>
wrote:
Hi,

This is a question about the HBase transaction model.
Suppose I have a table with two columns "c1" and "c2". Now assumefor eachtimestamp in each row, I have a entry in each column. That is,assume thetable is ALWAYS written such that it is "dense" (e.g. a completerelation)
rather than sparse (e.g. a partial relation) using the transaction
semantics:

  HTable table = new HTable(conf, new Text("test"));
  static final Text rowId = new Text("row_num");
  static final Text col1Id = new Text("c1");
  static final Text col2Id = new Text("c2");

  long lockid = table.startUpdate(rowId);

  // always write all columns in a row
  table.put( lockid, col1Id, val );
  table.put( lockid, col2Id, val );

  table.commit( lockid, timestamp );

Also assume that  the two columns are read as something like:

  byte[][] c1Vals = table.get( rowId, col1Id, versions );
  byte[][] c2Vals = table.get( rowId, col2Id, versions );
Is it guaranteed that for each index value i, c1Vals[i][] andc2Vals[i][]are the two column entries originally written with the sametimestamp?
Also, is something like:

  byte[][] c1Vals = table.get( rowId, col1Id, MAX_VALUE );
sufficient to guarantee all versions are returned in the "get"operations?
Thanks,
Rick

Re: HBase transactions model question

Reply via email to