In my case, the cache_vary_headers is off. Here is all "vary" lines in my records.config :
CONFIG proxy.config.http.cache.enable_default_vary_headers INT 0 CONFIG proxy.config.http.cache.vary_default_text STRING NULL CONFIG proxy.config.http.cache.vary_default_images STRING NULL CONFIG proxy.config.http.cache.vary_default_other STRING NULL I attach the entire file with this mail. I currently use version 3.2.4. What version did you use? The question is: how can we purge a file if the original URL is not available? Thanks for your time and help. Réjean Bouchard Nexweb -----Message d'origine----- De : Leif Hedstrom [mailto:[email protected]] Envoyé : 21 juin 2013 01:42 À : Réjean Bouchard Cc : [email protected] Objet : Re: Want to get the original URL On 6/20/13 2:46 PM, Réjean Bouchard wrote: > The reason why I'm looking for this is simple. The TS keep multiple > copies based on the inbound domain. Here is a way to prouve this concept: > > Create 2 domain ex: ts.mysite.com and ts2.mysite.com. > Remap those domains to www.mysite.com > Create test.txt file with the text "first file" > Go to ts.mysite.com/test.txt : you will see "first file" > Change test.txt content to "second file" > Go to ts2.mysite.com/test.txt : you will see "second file" > Clear browser cache Well, that's not how it's designed to behave, and I can not reproduce this in my own tests. This is what I have in my remap.config map http://ts1.example.com http://localhost:82 map http://ts2.example.com http://localhost:82 I cleared the cache ("sudo traffic_server -Cclear"), and started it up: $ curl -D - -H "Host: ts1.example.com" -H "Cache-Control: only-if-cached" http://localhost/test.txt HTTP/1.1 504 Not Cached $ curl -D - -H "Host: ts2.example.com" -H "Cache-Control: only-if-cached" http://localhost/test.txt HTTP/1.1 504 Not Cached Neither requests gives a cache hit. Now I allow it to cache for the ts1.example.com domain: $ curl -D - -H "Host: ts1.example.com" http://localhost/test.txt HTTP/1.1 200 OK Then same tests as above: $ curl -D - -H "Host: ts1.example.com" -H "Cache-Control: only-if-cached" http://localhost/test.txt HTTP/1.1 200 OK $ curl -D - -H "Host: ts2.example.com" -H "Cache-Control: only-if-cached" http://localhost/test.txt HTTP/1.1 200 OK I can also verify that both URLs gives the same response. And the Age: header (a good indicator) are identical, and I do not see an origin request for more than one request. I have no idea why you are not getting this behavior. What you are experiencing is simply not how it works. A *wild* guess is that you are maybe doing Vary: on some headers, and that causes it to create different entries for various requests (which is as it should). -- Leif > Change test.txt content to "third file" > Go to ts.mysite.com/test.txt : you will see "first file" > Go to ts2.mysite.com/test.txt : you will see "second file" > There is only one entry in cache if you scan it from regex search > > So, the reason why you want to be able to see the original URL request is to > be able to flush all the version of test.txt. > Let say that you have a 15,000,000 images cached that is generated by users > and you want to purge the cache of every file that have some values in the > URL (ex: picture size 10X40). > Flushing the complete cache for that purpose can be trivial. In the other > hand, having to generate a purge request for every image in the database is > not the optimal way and can be a pain. > Now, having the ability to purge from a regex can be the optimal and the > best solution. > I'm fixing the webUI for this purpose. And since the system return only the > remapped URL and it's not possible to purge a remapped URL, it's not very > usefull. I try the HTTPInfo->request_url_get() function return nothing, I > decided to ask here where the info was. > > So, what would think if I fix the TS so this information may be available by > the function? Do you see a reason why not? > > > Réjean Bouchard > Nexweb > > > -----Message d'origine----- > De : Leif Hedstrom [mailto:[email protected]] > Envoyé : 20 juin 2013 10:42 > À : [email protected] > Cc : Réjean Bouchard > Objet : Re: Want to get the original URL > > On 6/20/13 6:49 AM, Réjean Bouchard wrote: >> 4 - Finally, this is the same problem when we check the checkbox and >> try to click on the "DELETE" button. >> >> >> >> So does anybody tell me where i can find those originals URL? > > Once in the cache, you can not track it back to the "original URL" (I'm > fairly certain at least). There's a simple reason for this: There are no > guarantees of a 1-to-1 mapping. It's entirely possible, and sometimes > likely, that 1,000 URLs can map to the same cache URL. Or 1,000,000 million > URLs... > > If this is important to you, you can log both the pristine and remapped URL, > and build up some sort of relationship in an external system. > > Cheers, > > -- Leif > >
