In article <[EMAIL PROTECTED]> you write:
> On Wed, 25 Jul 2001, Ross Virgin wrote:
...
> At the very least, a website opens two connections.
> The first to get the HEAD info, which you see in your
> cache with the .htp extension, and the second will be
> the html file. More than likely it will open another,
> grab a graphics file, close that. Open another, grab
> that graphics file, close, etc., until the entire page
> is grabbed. If such data really were "spooled" you
> couldn't make any sense of it anyway.
HEAD is hardly ever used and, on my website at least, never by Arachne.
99.5% of all access is by GET requests.
If all HTML files are cached to disk, they can obviously be reviewed
offline and it's comparatively easy to track a site's access history if
either the files are saved with sequential local names or an index is
maintained (this latter may require some sort of index processor).
An additional trick I like is for each file to have it's URL added at
the top of the cached file in a comment.
Alex.
--
____________________________ _______________________________
( Alex Venn ) ( Success has many fathers, )
(_) [EMAIL PROTECTED] (____) but failure is an orphan. (_)