Re: Extensible command syntax

Dustin Sallings Mon, 12 Nov 2007 22:23:33 -0800


On Nov 8, 2007, at 2:35, Tomash Brechko wrote:

        I fail to see what I'm missing.  As far as I can tell, you're
describing what I already do.  See my write up on client optimization
and let me know what I'm missing.

http://bleu.west.spy.net/~dustin/projects/memcached/optimization.html


According to this page, when several threads issue [a], [b], [a, b,
c], [a], [d], you combine this requests into the one, [a, b, c, d].
Alright, suppose you got the reply, [c, d].  How do you know what to
reply to each thread?  You have to compare keys in client side to know
what results you have got.  But if the reply was [nil, nil, c, d],
you'll have to compare only numbers: first goes to t1, t3, t4, second
goes to t2. t3, etc.

In the binary protocol, you do only use numbers (keys aren'treturned). In the text protocol, you do string compares on theresults. In both cases I use a hash table. If it ever bubbles up inmy profiler, I might try to make it more efficient, but in themeantime, it doesn't seem to matter much.

What matters is _overall_ throughput, not solely server performance.
If you optimize the server at the cost of additional work on all
clients, it's not good.

If you optimize the server at the cost of additional work on theclients, you're optimizing the more centralized resource at the costof the less centralized resource. The server absolutely has to be themost optimal part of the whole thing for this very reason.

That's not to say the optimization of other components should beignored, but I have not found any dissatisfaction in my ability tooptimize things with the existing protocol.

        In the text protocol, a get with several keys only returns hits and
an end marker.  The idea is that if you're issuing that request,
you're probably going to return some kind of dictionary structure to
something.


This "probably" comes from no where, and is a bad assumption for the
generic design.  Client might not need to have the dictionary, and
currently it is forced to have it.

Does any client out there do a multi-get for a series of keys and notreturn values mapped to those keys?

        Ah, well in the general case, there's no processing to do for not
found keys.


As follows from your page, t1 that has requested [a] would have to
wait until [d] is processed, while it could continue once [nil] for
[a] has been returned.

On the page you should also describe the drawbacks of single
I/O-thread approach: that threads effectively block each other.  For
instance, high-priority t5 asking for small data for [d] would be
blocked by low-priority t1 asking for large data for [a].

I don't see that as a drawback. If you actually had differentpriorities for different request types and determined that you wereactually running into a blocking condition as such that caused alatency problem, you could just use different client instances andit'd go away immediately. Alternatively, if anyone cared, I couldimplement a priority concept for requests.

It had been suggested that I allow multiple connections perdestination per client to reduce latency. I experimented with abranch doing that, but I wasn't able to measure a difference (I thinkyou need more computers than I have available for such a test).

Whichever way you look at it, it's only a drawback to the single IOthread approach if you can demonstrate that my IO thread is somehowlimiting the throughput.

It seems like memcached has historically been single threaded formost installations, and as far as I can tell, when it's multithreadedit's not because one thread can't keep IO buffers full.


--
Dustin Sallings

Re: Extensible command syntax

Reply via email to