In article <[EMAIL PROTECTED]> you write:
> On Wed, 25 Jul 2001, Ross Virgin wrote:
...

>   At the very least, a website opens two connections.
> The first to get the HEAD info, which you see in your
> cache with the .htp extension, and the second will be
> the html file.  More than likely it will open another,
> grab a graphics file, close that.  Open another, grab
> that graphics file, close, etc., until the entire page
> is grabbed.  If such data really were "spooled" you 
> couldn't make any sense of it anyway.

HEAD is hardly ever used and, on my website at least, never by Arachne.
99.5% of all access is by GET requests.
If all HTML files are cached to disk, they can obviously be reviewed 
offline and it's comparatively easy to track a site's access history if
either the files are saved with sequential local names or an index is
maintained (this latter may require some sort of index processor).
An additional trick I like is for each file to have it's URL added at 
the top of the cached file in a comment.

Alex.
-- 
   ____________________________    _______________________________
  (    Alex Venn               )  (   Success has many fathers,   )  
 (_)   [EMAIL PROTECTED]     (____)  but failure is an orphan.  (_)

Reply via email to