Aha. It was not our greps that were wrong, but I finally found a
configuration to count the article filetypes correctly...

(details)
with these 5 lines from my sample log:
216.88.155.170 - - [15/Aug/1999:23:58:29 -0500] "GET
/stOnLine/cgi-bin/article?thisSlug=VIKE0816 HTTP/1.0" 200 10575
216.88.155.170 - - [15/Aug/1999:23:58:37 -0500] "GET
/stonline/graphics/article/related_items.gif HTTP/1.0" 200 327
207.205.141.42 - - [15/Aug/1999:23:58:30 -0500] "GET
/cgi-bin/stOnLine/article?thisStory=15019645 HTTP/1.1" 200 7430
209.240.200.146 - - [15/Aug/1999:23:58:21 -0500] "GET
/stOnLine/cgi-bin/article?thisSlug=monlet081699 HTTP/1.0" 200 10310
216.88.155.170 - - [15/Aug/1999:23:58:34 -0500] "GET
/stonline/graphics/article/sports_return.gif HTTP/1.0" 200 1434

and this configuration for pages:
PAGEINCLUDE /cgi-bin/stOnLine/article*,/stOnLine/cgi-bin/article*

we get 5 successful requests and 3 successful requests for pages - what
we want. Other page configurations do not produce the correct results.
The question mark in the config line does not help. If you have any
other directories besides /stOnLine/cgi-bin/ or /cgi-bin/stOnLine/ for
the article filetype, please let me know.

I am going to do some similar tests on the folder filetype, now, too.

--kong





ikong wrote:
> 
> Jim,
> 
> I will run a test with just a few lines from one of your logs to see if
> Stephen Turner might be right.
> 
> Kong
> 
> -------- Original Message --------
> Date: Mon, 16 Aug 1999 15:15:53 +0100 (GMT)
> From: Stephen Turner <[EMAIL PROTECTED]>
> To: analog-help <[EMAIL PROTECTED]>
> Subject: Re: [analog-help] Analog pageview configuration
> 
> ikong wrote:
> 
> > We have read the Analog FAQs but still have a couple of questions about
> > configuring Analog. Specifically, we have been attempting to include as
> > page views certain file types that do not follow the conventional *.*
> > pattern. Here are a few lines from our recent logs that show the 2 file
> > types we would like to count as pages:
> >
> >    209.166.166.119 - - [08/Aug/1999:23:58:48 -0500] "GET
> >    /stOnLine/cgi-bin/article?thisStory=80826316 HTTP/1.0" 200 2853
> >    216.228.36.111 - - [08/Aug/1999:23:58:49 -0500] "GET
> >    /stOnLine/cgi-bin/article?thisSlug=und09 HTTP/1.1" 200 12961
> >    212.211.68.13 - - [08/Aug/1999:23:59:15 -0500] "POST
> >    /cgi-bin/stOnLine/folder HTTP/1.1" 200 9447
> >
> > We have been experimenting with counting these files as pages with either:
> >
> >     ARGSINCLUDE 
>/stOnLine/cgi-bin/article,/stOnLine/cgi-bin/folder,/cgi-bin/stOnLine/folder,/cgi-bin/stOnLine/folder
> >     PAGEINCLUDE 
>/cgi-bin/stOnLine/article*,/stOnLine/cgi-bin/article*,/stOnLine/cgi-bin/folder*,/cgi-bin/stOnLine/folder*,*.shtml,*.SHTML,*.HTML,*.HTM,*.cgi,*.txt,*.swf,*.qt,*.msgread,*.ram,*.mov,*.rpm,*.wav,*.msginput,*.hperl,*.au,*.avi
> >
> > or
> >
> >    PAGEINCLUDE 
>*article?*,*folder,*.shtml,*.SHTML,*.HTML,*.HTM,*.cgi,*.txt,*.swf,*.qt,*.msgread,*.ram,*.mov,*.rpm,*.wav,*.msginput,*.hperl,*.au,*.avi
> >
> > as directives in the Analog configuration files.
> >
> > However, neither method has produced pageview counts consistent with
> > what we expected when adding counts from greps for the relevant lines
> > to our original pageview count (without the article and folder parts).
> 
> Well,
>   PAGEINCLUDE /stOnLine/cgi-bin/article*
> will certainly include the first two lines above as pages. (You can try
> it
> using a logfile containing just those two lines to check.) So maybe it's
> your grep count which is wrong?
> 
> --
> Stephen Turner    [EMAIL PROTECTED]
> http://www.statslab.cam.ac.uk/~sret1/
>   Statistical Laboratory, 16 Mill Lane, Cambridge CB2 1SB, England
>   "Due to the conflict in Kosovo, we will not be showing the movie Wag
> the
>    Dog. Instead, we will show Mortal Kombat: Annihilation." Cable &
> Wireless
> 
> ------------------------------------------------------------------------
> This is the analog-help mailing list. To unsubscribe from this
> mailing list, send mail to [EMAIL PROTECTED]
> with "unsubscribe analog-help" in the main BODY OF THE MESSAGE.
> List archived at
> http://www.mail-archive.com/[email protected]/
> ------------------------------------------------------------------------
begin:vcard 
n:Fu;I-Kong
tel;work:(919) 836-5684
x-mozilla-html:FALSE
url:http://www.nandomedia.com
org:Nando Media
version:2.1
email;internet:[EMAIL PROTECTED]
title:Software Developer
adr;quoted-printable:;;127 West Hargett St., Suite 406=0D=0A;Raleigh;NC;27601;USA
x-mozilla-cpt:;26432
fn:I-Kong Fu
end:vcard

Reply via email to