Has anyone compared it to running as many separate non-threaded instances as you have processors?
That's how it was done originally. The idea of threading it is to get greater memory efficiency, and higher efficiency of multi-get requests. IE; box has 64G of RAM, but you'll peg a single CPU before using all of it, but your code does tons of mgets and it sucks a bit to poll multiple memcached's for the data.
-Dormando
