On Wed, 8 Mar 2000, Tony Wasserman wrote:

> 1) Making the change to
> REQFLOOR 1r
> didn't do the job.  The request report still doesn't find the .jsp pages.
> REQFLOOR -100r
> 
> Adding
> REQINCLUDE *
> didn't help either,  but Analog found *lots* of GIF files that were previously
> excluded by the
> REQINCLUDE pages
> command
> 

Well, if REQFLOOR 1r and REQINCLUDE * together don't find them, then you
must have excluded or aliased them somehow, or else those lines are corrupt.
I suggest you make a logfile with just half a dozen of the *.jsp requests
and that should enable you to identify what's going on.

> 2) The Keynote requests come from lots of different hosts.  The idea is 
> that they
> are testing the responsiveness of various web sites (particularly the most 
> popular
> ones) to provide statistics on the overall performance of the web for 
> commercial
> interests.  As a result, they originate their requests from lots of 
> different hosts
> and IP addresses.  The only way to exclude is on some kind of browser exclude.
> 

Well, I don't know what these ones have for their browser string. But from
your previous message, I gathered that it was indistinguishable from a
regular Mozilla.

In general, you can't identify all spiders. If they don't come from
a distinctive host, and don't have a distinctive browser name, there's
nothing to separate them from normal requests.

-- 
Stephen Turner               http://www.statslab.cam.ac.uk/~sret1/
  Statistical Laboratory, 16 Mill Lane, Cambridge CB2 1SB, England
"8th March 2000. National No Smoking Day. Ash Wednesday." (On a calendar)

------------------------------------------------------------------------
This is the analog-help mailing list. To unsubscribe from this
mailing list, send mail to [EMAIL PROTECTED]
with "unsubscribe" in the main BODY OF THE MESSAGE.
List archived at http://www.mail-archive.com/[email protected]/
------------------------------------------------------------------------

Reply via email to