On Wed, 25 Apr 2001, Iain Hunneybell wrote:
> I seem to be getting some very odd page count figures and wonder if someone
> can help me. I've also discovered an 'undocumented feature' in analog 4.16!
>
> 'Undocumented features' first!
> I've discovered that analog will be confused by a period (.) in the URL. I
> have a combination of FILEEXCLUDE but then find an exclude filetype
> appearing in the Request Report
>
> >From the .cfg file
> PAGEEXCLUDE *.gif,*.css,*.jpg,*.js,*.bat,*.ico,*.swf
> FILEEXCLUDE *.gif,*.css,*.jpg,*.js,*.bat,*.ico,*.swf
>
> And then in the Requests Report
> 5515: 20.92%: : /com.coname/images/conamehome_store.swf
>
> It would look like the (albeit unconventional) directory name 'com.coname'
> is causing the parsing of the URL to see this as a *.coname file. not as a
> *.swf and so the file count is included even though it's explicitly
> excluded.
>
No, I think it don't think that that's the problem. Do the *.swf files have
arguments after them? If so, you also need to exclude *.swf?* -- see
docs/args.html.
> Now my page count problem
> This comes from the same experiments with analog. My problem is that the
> page count being given in the general summary is less even than some of the
> individual page counts. So from the same report:
>
> Successful requests for pages: 9,289
>
> Now the Requests Report on the same report shows:
> reqs: %reqs: %pages: file
> ----: ------: ------: ----
> 9535: 36.16%: : /conamehomepage
>
> Note that <conamehomepage> shows a count of 9,535 whereas the 'Successful
> requests for pages' is actually less at 9,289.
>
> I've read through the documentation and simply can't figure out how
> 'Successful requests for pages' can actually be less than the total requests
> for pages. Can anyone give me any help?
>
> The .cfg includes the folling PAGEINCLUDE/EXCLUDE statements:
> # PAGEINCLUDE
> *.asa,*.asp,*.cdx,*.cer,*.cfm,*.dbm,*.htm,*.html,*.htmls,*.htp,*.htpl,*.htr,
> *.htw,*.ida,*.idc,*.idq,*.mdl,*.sht,*.shtm, *.shtml,*.stm,*.txt,/
> # FILEINCLUDE
> *.asa,*.asp,*.cdx,*.cer,*.cfm,*.dbm,*.htm,*.html,*.htmls,*.htp,*.htpl,*.htr,
> *.htw,*.ida,*.idc,*.idq,*.mdl,*.sht,*.shtm, *.shtml,*.stm,*.txt,/
> PAGEEXCLUDE *.gif,*.css,*.jpg,*.js,*.bat,*.ico,*.swf
> FILEEXCLUDE *.gif,*.css,*.jpg,*.js,*.bat,*.ico,*.swf
>
> Note I have done a good deal of experimentation around PAGE/FILEINCLUDE
> including adding */ and simply / (as illustrated) as a PAGE and FILEINCLUDE
> option.
When analog starts up, it includes the (equivalent of the) command
PAGEINCLUDE *.html,*.html,*/
So you can imagine this command at the top of your configuration file.
Now /conamehomepage doesn't match any of the inclusions or exclusions, and a
PAGEINCLUDE is first, so /conamehomepage isn't a page.
If you really want to include everything except named stuff as a page, then
put
PAGEINCLUDE *
before the PAGEEXCLUDE command.
This is obvious if you run
analog -settings
It's not obvious if you only read the documentation. :)
--
Stephen Turner http://www.statslab.cam.ac.uk/~sret1/
Statistical Laboratory, Wilberforce Road, Cambridge, CB3 0WB, England
"Your account can only be used for a single internet session at any one
time and for no more than 24 hours in any one day." (NTL terms of use)
+------------------------------------------------------------------------
| This is the analog-help mailing list. To unsubscribe from this
| mailing list, go to
| http://lists.isite.net/listgate/analog-help/unsubscribe.html
|
| List archives are available at
| http://www.mail-archive.com/[email protected]/
| http://lists.isite.net/listgate/analog-help/archives/
| http://www.tallylist.com/archives/index.cfm/mlist.7
+------------------------------------------------------------------------