Edward, We're working on a user-facing web system backed by Hbase. More read-oriented than a mail system, but it does also have web users writing to it. We're making heavy use of memcached because HBase random read is not fast enough. Haven't tried BLOCKCACHE yet, but reading a random row from HBase generally costs us about 150ms, which when multiplied by 10-20 records is expensive. We think it's this slow because of the quantity of data we're transporting, but haven't fully figured it out yet -- MySQL and memcached can deliver the same quantity of data in 1/10th the time. If you can model your data to favour reading with scanners instead of randomly, I'm sure you could do much better. I know that the scanner code was recently optimized with a batching strategy.
We're using Solr/Lucene for secondary indexes & searching. We often display indexed results instead of retrieving data from the database. We generally do only one HBase getRow call per user HTTP request, the rest comes from Solr or memcached. We haven't rolled out beyond a small alpha user group, so the system is not proven in the real world. Like Stack says: try it and see what happens. And be prepared to switch to an ugly MySQL sharding approach if it doesn't work out. j On Tue, Nov 18, 2008 at 9:21 PM, Edward J. Yoon <[EMAIL PROTECTED]>wrote: > Does anyone have some opinion about this? > > On Tue, Nov 18, 2008 at 11:18 AM, Edward J. Yoon <[EMAIL PROTECTED]> > wrote: > > Hi, > > > > I'm considering to store the large-scale web-mail data on the Hbase. > > IMO, I expect to be able to solve both real-time and batch (e.g. spam > > filtering, from/to graph, ..., etc) issues. But I'm still not sure > > whether it's suitable for storing web mail data. The stable online > > real-time service should be possible to be a web mail service. > > > > Does anyone tried similar one (real-time application), Or know about > > gmail architecture? > > Any advices are welcome, Thanks! > > > > -- > > Best Regards, Edward J. Yoon @ NHN, corp. > > [EMAIL PROTECTED] > > http://blog.udanax.org > > > > > > -- > Best Regards, Edward J. Yoon @ NHN, corp. > [EMAIL PROTECTED] > http://blog.udanax.org >
