I have been doing an experiment with testing analog's accuracy for someone
who is pretty demanding about having exactly right answers (I keep telling
him it's a comparative science . . .). In general, when I try to use
manual methods to count up the same values that analog is reporting on, I
get different answers than analog does. Of course, I could be leaving out
a lot of steps. I'm hoping someone can give me some ideas why my method is
wrong.
For example, I first ran a grep -v [ip-address] on a logfile to remove
requests from the company's firewall--to ignore all requests from internal
sources, which they like to do. In analog, we achieve the same thing with
HOSTEXCLUDE [ip-address]
Then, if I run a wc -l on the output to count the total number of entries,
I invariably get a higher count than analog's total requests. It varies
from about 5% to 10% higher.
I don't think it's corrupted log file entries; usually those run 1%-2% of
the total requests (normally because of referrers with commas in them,
unfortunately).
What else could be going on? Or--how should I be using manual methods to
verify analog's numbers?
Thanks,
Matt Morgan
------------------------------------------------------------------------
This is the analog-help mailing list. To unsubscribe from this
mailing list, send mail to [EMAIL PROTECTED]
with "unsubscribe" in the main BODY OF THE MESSAGE.
List archived at http://www.mail-archive.com/[email protected]/
------------------------------------------------------------------------