Re: Omid: Transactional Support for HBase

Flavio Junqueira Sun, 06 Nov 2011 12:54:08 -0800

A quick note on Omid for the ones following on github: the repositorywe will be working with is the fork under the Yahoo! account:


https://github.com/yahoo/omid/


-Flavio

On Nov 5, 2011, at 9:36 PM, Daniel Gómez Ferro wrote:

On Nov 5, 2011, at 05:37 , lars hofhansl wrote:
Cool stuff Daniel,
Hi Lars,

Thanks for the good points.
Was looking through the code a bit. Seems like you make a besteffort to push as much ofthe filtering of KVs of uncommitted transactions to HBase and thendo some filtering on the clientnot a bad approach. (I hope I didn't misunderstand the approach,only looked through the code for
1/2 hour or so).
Putting it more accurately, the uncommitted KVs are stored at HBase,but it is the client's job to filter them using the commitinformation that it has received from the status oracle. Accordingto snapshot isolation guarantee, all the versions that are insertedwith a timestamp larger than the transaction start timestamp must beignored, which is done by setting the time range on the client's getrequest sent to HBase. Since the uncommitted changes of the abortedtransactions are eventually removed from HBase, the client rarelyneeds to fetch more than a version to reach a KV that is committedbefore the transaction starts (the first property of snapshotisolation).
One thing I was wondering: Why bookkeeper? Why not store the WALitself in HBase? That way
you might not even need a separate server.
Did you see: HBaseSI (http://www.cs.uwaterloo.ca/~c15zhang/HBaseSI.pdf), they also do MVCCon top of unaltered HBase/schema, although from reading that paperI get the impression that itwould not scale to scans touching many rows (which is where yourclient side filtering comes in).
Thanks for the link. We had seen the other paper of the same authors(Grid2010) that shares the same bottlenecks with the recent work.As you pointed out correctly, the question is about performance. Youcould see the scalability bottleneck of 400 TPS in the evaluationsection of this paper. Our approach, however, provides snapshotisolation with a negligible overhead on region servers, and couldscale up to tens of thousands write transactions per second. If youare interested, a summary of techniques that we used to achieve thisperformance is published at SOSP'11, poster section.
http://sigops.org/sosp/sosp11/posters/summaries/sosp11-final12.pdf
-- Lars


----- Original Message -----
From: Daniel Gómez Ferro <[email protected]>
To: "[email protected]" <[email protected]>; "[email protected]" <[email protected]>Cc: Maysam Yabandeh <[email protected]>; Flavio Junqueira <[email protected]>; Benjamin Reed <[email protected]>; Ivan Kelly <[email protected]>
Sent: Friday, November 4, 2011 4:24 AM
Subject: Omid: Transactional Support for HBase

(I apologize for resending but I forgot to add the user list.)

Hi all,
It is my pleasure to announce the open source release of Omid, aproject whose goal is to add lock-free transactional support on topof HBase. The current release includes CrSO, a client-replicatedstatus oracle that detects the write-write conflicts to provideSnapshot Isolation. CrSO has the following appealing properties:
1) It does not need any modification into the HBase code nor thetable scheme.2) The overhead on HBase DataNodes is negligible (only after anabort)3) It scales up to 50,000 write transactions per second (TPS) and athousand of client connections.
We have setup a github project: https://github.com/dgomezferro/omid

More information is available at the wiki: 
https://github.com/dgomezferro/omid/wiki
If you are interested, installation and running instructions areavailable on the README: https://github.com/dgomezferro/omid/blob/master/README.md
Please do not hesitate to contact us in the case of any question.

Best Regards,
Daniel Gómez Ferro


flavio
junqueira

research scientist

[email protected]
direct +34 93-183-8828

avinguda diagonal 177, 8th floor, barcelona, 08018, es
phone (408) 349 3300    fax (408) 349 3301

Re: Omid: Transactional Support for HBase

Reply via email to