Re: [RFC] cache architecture

Pieter De Wit Tue, 24 Jan 2012 09:03:20 -0800

Hard to implement given the current "leg work" is already done ? Howwell does the current version of squid handle multicores and can thistake advantage of cores ?
Should be easy. We have not exactly checked and cocumented the DiskIOlibrary API. ButThe current AIO handles SMP exactly as well as the system AIO librarycan, same for pthreads library behind DiskThreads.

Taken from iscsitarget - they have a "wthreads x" config option thatspawns x number of threads for write only I believe, not sure of thereading. You can't control this in the AIO Lib. (I think ?) but perhapssomething like this could be useful for pthreads


<snip>

The cache_dir can report this up to the top layer via their loadingfactor when they are not servicing requests. I was considering it toprioritise CLEAN builds before DIRTY ones or cache_dir by the speed ofits storage type and loading factor.

It seems we are heading to naming cache_[dir|mem] otherwise the optionsmight become confusing ? Almost the same as "cache_peer name=bla" (Whilewriting this below I came up with another idea, storage weights whichmight solve "my issue" with the double object store)


cache_dir name=sata /var/dir1 128G 128 128
cache_io_lib sata AIO
cache_rebuild_weight sata 1
cache_request_weight sata 100
cache_state sata readonly
cache_weight sata 100

cache_dir name=ssd /var/dir2 32G 128 128
cache_io_lib ssd pthreads
cache_rebuild_weight ssd 100
cache_request_weight ssd 1
cache_state ssd all
cache_weight ssd 10

cache_mem name=mem1 1G
cache_state mem1 all
cache_weight mem1 1

(I feel the memory one is "out of place" but perhaps someone else hasanother idea/thought process - why would you need two cache_mem's ?)

What I wanted to show above was the use of "name=" in cache_dir, thatlead to another idea, "cache_weight". So we are happy that the optionsare now settable per cache_dir :)

cache_weight will allow an admin to specify the "transit cost" ofobjects in a cache. Squid starts up and wants to serve objects asquickly as we can. Memory can be used without issues right away forcaching. Now we start to initialize the disk caches. In my exampleabove, the ssd cache should init before the sata giving us some storagespace. During the init of the "sata" cache, the memory allocation isalready filled up, squid starts expiring objects to the next cost, so anobject would travel from memory, to "ssd" (much the same as it does now ?)

Now, "sata" is still busy with init'ed, but "ssd" has also filled up, sowe are forced to retire an object in "ssd", much like we do now. Once"sata" is done, it will join the queue, so objects will expire like:


mem1->ssd->sata (ignoring the fact that it's set to read-only for now)

If we have an object already in the "sata" cache that is new in "ssd" wewould expire that object as soon as the "sata" cache is done setting up.We do how ever now have the overhead of reading an object, writing itsomewhere else (please please please admins - make it other spindles !!!:) ), freeing the original space, then write the new object.


Another example:

"sata" init'ed before "ssd":

Before init: mem1->sata
After init: mem1->ssd->sata

Now we could have the problem of sata and ssd having the same object. Wewould expire the higher cost one (the one in sata) since the object isrequired more than we "thought" ? This is the *only* way object cantravel "up" the disk cost chain, otherwise we could be throwing objectsbetween cache's all day long.


Let's stop there while I am ahead :)

For every x requests, action an "admin/clean up" request, unless"Queue 1" is empty, then drain "Queue 2"
I am also thinking of a "third" queue, something like:
Queue 1 - Write requests (depends on cache state, but has the mostimpact - writes are slow)
Queue 2 - Read requests (as above, but less of an impact)
Queue 3 - Admin/Clean up
The only problem I have so far is Queue 1 is above Queue 2.....theymight be swapped since you are reading more than writing ? Perhapsanother config option.....
cache_dir /var/dir1 128G 128 128 Q1=read Q2=write (cache_dir syntaxwrong....)cache_dir /var/dir2 32G 128 128 Q1=write Q2=read (as above, but thismight be on ssd)
I think this might be going too far ?

Cheers,

Pieter


No comments on the three queue's per cache "space" ?

Re: [RFC] cache architecture

Reply via email to