Re: 1TB memcached

Matt Ingenthron Wed, 22 Sep 2010 13:20:16 -0700

 On 9/22/10 10:23 AM, Les Mikesell wrote:

On 9/22/2010 11:59 AM, Matt Ingenthron wrote:

On 9/22/10 6:12 AM, ligerdave wrote:

MongoDB is actually "cached" db, meaning that, most of its records are
in memory.


I think there is also a memcached and DB hybrid which comes w/ a
persistent option. i think it's called memcachedDB, which runs a in-
memory db(like mongodb). this shares most of common api w/ memcached
so you dont have to change code very much


membase is compatible with memcached protocol, has a 20MByte default
object size limit, lets you define memory and disk usage across nodes in
different "buckets".

memcacheDB is challenging to deploy for a few reasons, one of which is
that the topology is fixed at deployment time.

Does anyone know how these would compare to 'riak', a distributeddatabase that can do redundancy with some fault tolerance and knowshow to rebalance the storage across nodes when they are added orremoved? (Other than the different client interface...).


This is a very detailed question, but...

Without going too much into advocacy (I'd defer you to the membaselist/site), membase does have redundancy, fault tolerance and canrebalance when nodes are added and removed. The interface to membase ismemcached protocol. It does so by making sure there is an authoritativeplace for any given piece of data at any given point in time. Thatdoesn't mean data's not replicated or persisted, just that there arerules about the state changes for a given piece of data based on vbuckethashing and a shared configuration.

This was actually inspired by similar concepts that in memcached'scodebase up through the early 1.2.x, but not in use in anywhere that I'mfamiliar with.

riak is more designed around eventually consistent and lots of tuningW+R>N, meaning that it is designed more to always take writes and dealwith consistency for reads by doing multiple reads. This is differentthan memcached in that memcached expects one and only one location for agiven piece of data with a given topology. If the topology changes(node failures, additions), things like consistent hashing dictate a newplace, but there aren't multiple places to write to.

Any time you accept concurrent writes in more than one place, you haveto deal with conflict resolution. In some cases this means dealing withit at the application level.

I don't know it well, but it's my understanding that MemcacheDB isreally just memcached with disk (BDB, IIRC) in place of memory on theback end. This has been done a few different times and in a fewdifferent ways. Topology changes are the killers here. Consistenthashing can't really help you deal with changes in this kind of deployment.


- Matt

Re: 1TB memcached

Reply via email to