aka "GET If-Modified-Since" (GET IMS))

Artur Bergman Mon, 27 Sep 2010 12:31:14 -0700

For persistant storage, just ignore the TTL and throw away the segmentwith the oldest object, refreshed or not.

I am of the opinion that if a method exists to verify the object, LMor Etag, we shouldn't ever expire it. The ttl is just a setting forwhen we should refresh it. Of course, standard LRU should still apply.

I am also less worried about the reader/writer scenario for theheaders, since by spec you shouldnt' update any headers that aren'tExpires/Cache-Control (and weirdly enough, Vary)


Artur

On Sep 27, 2010, at 6:50 AM, Nils Goroll wrote:

Hi,

I'd like to add a brief update to the following section summarizing my
understanding after talking to phk today, who seems to be reallybusy and
probably will not find time to respond before the weekend:
To allow multiple cache objects to share body data, we want to add
reference counters to struct storage following the example of the
existing implementation for objects (HSH_Ref(), HSH_Unref() etc).
Though I still believe this should be pretty straight forward forall otherstorages, it won't be for -spersistent. After studying the code foran hour or
so, my understanding is the following:

Persistent storage segments the cache (see
http://www.varnish-cache.org/trac/wiki/ArchitecturePersistentStorage) and won'tre-use segments for new objects unless they are completely empty (noliveobjects). Right now, this relies on the LRU and TTL based expiry toeventuallyclean out segments before running out of space. Having multiple refsto the sameobj in persistent storage (and updating it again and again) wouldeffectively
lead to more and more segments being kept from becoming empty.

I believe what is really needed is additional space management for the
persistent storage. In a first step, when running short of storage,objectscould get nuked from the smallest segment. In a second step, themechanics tocopy live objects from one segment to another could be implemented.Ideally,this could be vcl controlled ("should we rather nuke the object orbothercopying it?"). But I see some complications for both, mainly thatstorage wouldneed to know which objects are referencing it in order to updatethose (sounds
wrong).
As long as we don't have any of this, I suggest two alternativetemporary solutions:
a) If an object getting refreshed lives in persistent storage, we'llsimply copyit. Actually, the existing Rackspace implementation does this. Thisis far fromoptimal, but won't make much of a difference for small objects andis still muchmore efficient than re-fetching the object from backend like today,so we
shouldn't see any performance regression.

For other stevedores, we'll use the reference counter.
b) Add reference counters to persistent storage, too, and simplylive with thecache fragmentation issue. Those using persistent storage would beadvised not
to use cache refresh.

At this point, I'd favor a).
Please note that all of this is my personal understanding. I amposting thesethoughts in the hope that my understanding is correct and I'd reallyappreciate
corrections if it's not.

Thank you, Nils

_______________________________________________
varnish-dev mailing list
[email protected]
http://lists.varnish-cache.org/mailman/listinfo/varnish-dev



_______________________________________________
varnish-dev mailing list
[email protected]
http://lists.varnish-cache.org/mailman/listinfo/varnish-dev

Re: Proposal/specs for backend conditional requests / aka "GET If-Modified-Since" (GET IMS))

Reply via email to