Re: [RFC] cache architecture

Pieter De Wit Tue, 24 Jan 2012 02:02:34 -0800

<snip>

Perhaps a 9) Implement dual IO queues - I *think* the IO has beenmoved into it's own thread, if not, the queuing can still be applied.Any form of checking the cache is going to effect squid, so how do weensure we are idle, dual queues :) Queue 1 holds the requests forsquid, queue 2 holds the admin/clean up requests. The IO "thread" (ifnot threaded), before handling an admin/clean up request checks Queue1 for requests, empties is *totally before* heading into Queue 2.This will allow you to have the same caching as now, relieving thestart-up problems ? Might lead to the same double cache of objects asabove (if you make the cache writable before the scan is done)
I wonder about priority queues every now and then. It is aninteresting idea. The I/O is currently done with pluggable modules forvarious forms. DiskThreads and AIO sort of do this but are FIFO queuedin N parallel queues. Prioritised queues could be an interestingadditional DiskIO module.

Hard to implement given the current "leg work" is already done ? Howwell does the current version of squid handle multicores and can thistake advantage of cores ?

What I'm looking for is a little bit more abstracted towards thearchitecture level across cache type and implementation. At that scalewe can't use any form of "totally empty" queue condition because oncaches that receive much traffic the queue would be quite full, maybenever actually empty. Several of the problems we have now are waitingon the cache load completed (ie the load action queue empty) before acache is even considered for use.
Amos

At that scale, no matter what you do, you will impact performance/your"wanted" outcome. It's about reaching an acceptable balance which Ithink, you, as a dev, will have a hard time predicting for any real lifeusage out there. Perhaps "we" (in " since I am yet to contrib a singleline of code :) ) can make it "Weighted Priority" and as such, havesquid.conf options to tune it. The Admin has to decide how aggresivesquid must be at rebuilding (makes me think of the raid rebuild optionsin HP RAID controllers) the cache. I am thinking of:


cache_rebuild_weight <0-"max int"> ?

For every x requests, action an "admin/clean up" request, unless "Queue1" is empty, then drain "Queue 2"


I am also thinking of a "third" queue, something like:

Queue 1 - Write requests (depends on cache state, but has the mostimpact - writes are slow)

Queue 2 - Read requests (as above, but less of an impact)
Queue 3 - Admin/Clean up

The only problem I have so far is Queue 1 is above Queue 2.....theymight be swapped since you are reading more than writing ? Perhapsanother config option.....

cache_dir /var/dir1 128G 128 128 Q1=read Q2=write (cache_dir syntaxwrong....)cache_dir /var/dir2 32G 128 128 Q1=write Q2=read (as above, but thismight be on ssd)


I think this might be going too far ?

Cheers,

Pieter

Re: [RFC] cache architecture

Reply via email to