In my case,  the cache_vary_headers is off.  Here is all "vary" lines in my
records.config :

CONFIG proxy.config.http.cache.enable_default_vary_headers INT 0
CONFIG proxy.config.http.cache.vary_default_text STRING NULL
CONFIG proxy.config.http.cache.vary_default_images STRING NULL
CONFIG proxy.config.http.cache.vary_default_other STRING NULL

I attach the entire file with this mail.  I currently use version 3.2.4.
What version did you use?  The question is: how can we purge a file if the
original URL is not available?

Thanks for your time and help.

Réjean Bouchard
Nexweb


-----Message d'origine-----
De : Leif Hedstrom [mailto:[email protected]] 
Envoyé : 21 juin 2013 01:42
À : Réjean Bouchard
Cc : [email protected]
Objet : Re: Want to get the original URL

On 6/20/13 2:46 PM, Réjean Bouchard wrote:
> The reason why I'm looking for this is simple.  The TS keep multiple 
> copies based on the inbound domain.  Here is a way to prouve this concept:
>
> Create 2 domain ex: ts.mysite.com and ts2.mysite.com.
> Remap those domains to www.mysite.com
> Create test.txt file with the text "first file"
> Go to ts.mysite.com/test.txt  :  you will see "first file"
> Change test.txt content to "second file"
> Go to ts2.mysite.com/test.txt  :  you will see "second file"
> Clear browser cache


Well, that's not how it's designed to behave, and I can not reproduce this
in my own tests. This is what I have in my remap.config

map http://ts1.example.com  http://localhost:82 map http://ts2.example.com
http://localhost:82


I cleared the cache ("sudo traffic_server -Cclear"), and started it up:

$ curl -D - -H "Host: ts1.example.com" -H "Cache-Control: 
only-if-cached" http://localhost/test.txt
HTTP/1.1 504 Not Cached

$ curl -D - -H "Host: ts2.example.com" -H "Cache-Control: 
only-if-cached" http://localhost/test.txt
HTTP/1.1 504 Not Cached

Neither requests gives a cache hit. Now I allow it to cache for the
ts1.example.com domain:

$ curl -D - -H "Host: ts1.example.com" http://localhost/test.txt
HTTP/1.1 200 OK


Then same tests as above:

$ curl -D - -H "Host: ts1.example.com" -H "Cache-Control: 
only-if-cached" http://localhost/test.txt
HTTP/1.1 200 OK

$ curl -D - -H "Host: ts2.example.com" -H "Cache-Control: 
only-if-cached" http://localhost/test.txt
HTTP/1.1 200 OK


I can also verify that both URLs gives the same response. And the Age: 
header (a good indicator) are identical, and I do not see an origin request
for more than one request.


I have no idea why you are not getting this behavior. What you are 
experiencing is simply not how it works. A *wild* guess is that you are 
maybe doing Vary: on some headers, and that causes it to create 
different entries for various requests (which is as it should).

-- Leif



> Change test.txt content to "third file"
> Go to ts.mysite.com/test.txt  :  you will see "first file"
> Go to ts2.mysite.com/test.txt  :  you will see "second file"
> There is only one entry in cache if you scan it from regex search
>
> So, the reason why you want to be able to see the original URL request is
to
> be able to flush all the version of test.txt.
> Let say that you have a 15,000,000 images cached that is generated by
users
> and you want to purge the cache of every file that have some values in the
> URL (ex: picture size 10X40).
> Flushing the complete cache for that purpose can be trivial.  In the other
> hand, having to generate a purge request for every image in the database
is
> not the optimal way and can be a pain.
> Now, having the ability to purge from a regex can be the optimal and the
> best solution.
> I'm fixing the webUI for this purpose.  And since the system return only
the
> remapped URL and it's not possible to purge a remapped URL, it's not very
> usefull.  I try the HTTPInfo->request_url_get() function return nothing, I
> decided to ask here where the info was.
>
> So, what would think if I fix the TS so this information may be available
by
> the function?  Do you see a reason why not?
>
>
> Réjean Bouchard
> Nexweb
>
>
> -----Message d'origine-----
> De : Leif Hedstrom [mailto:[email protected]]
> Envoyé : 20 juin 2013 10:42
> À : [email protected]
> Cc : Réjean Bouchard
> Objet : Re: Want to get the original URL
>
> On 6/20/13 6:49 AM, Réjean Bouchard wrote:
>> 4 - Finally, this is the same problem when we check the checkbox and
>> try to click on the "DELETE" button.
>>
>>
>>
>> So does anybody tell me where i can find those originals URL?
>
> Once in the cache, you can not track it back to the "original URL" (I'm
> fairly certain at least). There's a simple reason for this: There are no
> guarantees of a 1-to-1 mapping. It's entirely possible, and sometimes
> likely,  that 1,000 URLs can map to the same cache URL. Or 1,000,000
million
> URLs...
>
> If this is important to you, you can log both the pristine and remapped
URL,
> and build up some sort of relationship in an external system.
>
> Cheers,
>
> -- Leif
>
>

Reply via email to