I have been doing an experiment with testing analog's accuracy for someone 
who is pretty demanding about having exactly right answers (I keep telling 
him it's a comparative science . . .).  In general, when I try to use 
manual methods to count up the same values that analog is reporting on, I 
get different answers than analog does.  Of course, I could be leaving out 
a lot of steps.  I'm hoping someone can give me some ideas why my method is 
wrong.

For example, I first ran a grep -v [ip-address] on a logfile to remove 
requests from the company's firewall--to ignore all requests from internal 
sources, which they like to do.  In analog, we achieve the same thing with

HOSTEXCLUDE [ip-address]

Then, if I run a wc -l on the output to count the total number of entries, 
I invariably get a higher count than analog's total requests.  It varies 
from about 5% to 10% higher.

I don't think it's corrupted log file entries; usually those run 1%-2% of 
the total requests (normally because of referrers with commas in them, 
unfortunately).

What else could be going on?  Or--how should I be using manual methods to 
verify analog's numbers?

Thanks,
Matt Morgan

------------------------------------------------------------------------
This is the analog-help mailing list. To unsubscribe from this
mailing list, send mail to [EMAIL PROTECTED]
with "unsubscribe" in the main BODY OF THE MESSAGE.
List archived at http://www.mail-archive.com/[email protected]/
------------------------------------------------------------------------

Reply via email to