[HACKERS] Webcluster session storage, was vacuum, performance, and MVCC

James Robinson Fri, 23 Jun 2006 07:57:42 -0700

Verging on offtopic, but ...

Regarding the best place for session data, memcached isn't really theanswer, for opposite reasons as to why it isn't so great to store itin the central DB for a bug web farm.

Folks on the memcached lists propose this [ "I keep all my sessiondata in memcached, it works great!" ] from time to time, and alwaysget smacked down with "Don't do that -- memcached is just for cachingstuff that is retrievable from some stable / slower source.Memcached's LRU policy could conceivably evict that session data atany time and you'll loose."

So, just it works fine under low / moderate load [ just as it does inPG when periodic vacuuming can keep up ], but under higher uservolume than anticipated your memcaches could well decide to evictotherwise good sessions. Especially if all your session data objectsare about the same size, but you also store things either smaller orbigger in memcache -- its current slab allocator subdivides totalheap space into slabs for like-sized objects on powers of two Ibelieve, so even though you gave memcache a half a gig or whatever toplay with, the available size for 1K objects will not be that half agig if you're also storing 20 byte and 10K values.

Therefore I would not recommend memcached as the sole container forsession data. It's a well designed champ as a fast cache, but itoughta be used just as a cache, not as a datastore. Hence the name.

For folks who say that the stuff should be in RAM at the appserverside -- well -- that does imply sticky session load balancing which Ibelieve most folks agree ought to be avoided if at all possible. Andin our specific case, our appserver code is fastcgi / multiprocess oneach webserver, so there's no central RAM to store it in, save forswallowing the shared memory bullet. Some sort of distributed sharedmemory [ which seems to be rather how Mohawksoft's MCache seems to bedesigned ] probably also performs best with sticky sessions --otherwise the session pages thrash between appservers on writes. I'venot read the docs with fine-tooth comb to really infer how it isdesigned, so please forgive and educate me if MCache has an elegantsolution for write-heavy non-sticky session use.

I'm squarely in the camp of just suck up and live with serialization-- storing session data in RAM on appservers in appserver processesreally really binds your hands from a deployment and administrationangle. And you don't even really realize it until you've tasted thefreedom of having be outside. It lets you, oh, say, do code updatesin the middle of the day -- not at 5AM. It lets you clusterhorizontally in a much more simple manner -- no needing to deal withany broadcast-writes voodoo. It lets you not have to have stickysessions. It lets you be much more, well, stateless, in yourappserver code. Our site used to be single-JVM JBoss-based withsessions in RAM, and after moving away to multiprocess fastcgi model-- the grass is way greener on this side.

In rewriting we're also going completely stateless -- no illusion ofscratch session data store [ we're so read-mostly that it is easilypossible ]. But if we 'had' to have session support, I'd like to usea system which worked something like:

1) Deployed centrally parallel to memcached and DB servers for thewebcluster.

        2) Performs update-in-place.
        3) Will not arbitrarily evict data.

4) Stores arbitrary volumes of info -- i.e. spills to disk if it hasto, but doesn't have to worry about fsyncing all day long -- be theytx logs or datafiles or what have you.5) Fast as hell for reading / writing 'hot' session data. Does nothave to worry about fast concurrent access to session data -- we'reprobably handling at most one single HTTP request per this session ata time.

6) Clusterable in memcached model -- client-side-code hashes thesession key to the backend session store. If said backend sessionserver becomes unresponsive, flag him as temporarily dead [ try againin 30 seconds? ] and remove from possible candidates. rehash and re-query.

7) Migrateable: If I need to bring down session server N, it stopsanswering requests and re-hashes what it has stored and transmits toits live peers. If hashing algorithm and list of servers were commonacross all servers + clients, and if perhaps a server response for afetch / store could contain a "redirect this to other server" typeresponse, then perhaps as the server going down streams the data toits peers and they ack reception, then this could possibly work w/oloss of data and would minimize delay at topology change time.

I could live with 'if one of my session servers croaks unplanned, Iloose those sessions'. Wishlist 6 and 7 items remind me more and moreof AFS or NFS4 client / server interaction. Not trivial unfortunately.

I hate to say it, but would mysql / myisam not work well for points1-5 ? I have not idea about it and 'fast as hell' -- not ever run itin production for anything. 6 + 7 could possibly be done atop mysqlusing a 3-tier model.


----
James Robinson
Socialserve.com


---------------------------(end of broadcast)---------------------------
TIP 1: if posting/reading through Usenet, please send an appropriate
      subscribe-nomail command to [EMAIL PROTECTED] so that your
      message can get through to the mailing list cleanly

[HACKERS] Webcluster session storage, was vacuum, performance, and MVCC

Reply via email to