On Wed, 25 Apr 2001, Iain Hunneybell wrote:

> I seem to be getting some very odd page count figures and wonder if someone
> can help me. I've also discovered an 'undocumented feature' in analog 4.16!
> 
> 'Undocumented features' first!
> I've discovered that analog will be confused by a period (.) in the URL. I
> have a combination of FILEEXCLUDE but then find an exclude filetype
> appearing in the Request Report
> 
> >From the .cfg file
> PAGEEXCLUDE *.gif,*.css,*.jpg,*.js,*.bat,*.ico,*.swf
> FILEEXCLUDE *.gif,*.css,*.jpg,*.js,*.bat,*.ico,*.swf
> 
> And then in the Requests Report
> 5515: 20.92%:       : /com.coname/images/conamehome_store.swf
> 
> It would look like the (albeit unconventional) directory name 'com.coname'
> is causing the parsing of the URL to see this as a *.coname file. not as a
> *.swf and so the file count is included even though it's explicitly
> excluded.
> 

No, I think it don't think that that's the problem. Do the *.swf files have
arguments after them? If so, you also need to exclude *.swf?* -- see
docs/args.html.

> Now my page count problem
> This comes from the same experiments with analog. My problem is that the
> page count being given in the general summary is less even than some of the
> individual page counts. So from the same report:
> 
> Successful requests for pages: 9,289
> 
> Now the Requests Report on the same report shows:
> reqs:  %reqs: %pages: file
> ----: ------: ------: ----
> 9535: 36.16%:       : /conamehomepage
> 
> Note that <conamehomepage> shows a count of 9,535 whereas the 'Successful
> requests for pages' is  actually less at 9,289.
> 
> I've read through the documentation and simply can't figure out how
> 'Successful requests for pages' can actually be less than the total requests
> for pages. Can anyone give me any help?
> 
> The .cfg includes the folling PAGEINCLUDE/EXCLUDE statements:
> # PAGEINCLUDE
> *.asa,*.asp,*.cdx,*.cer,*.cfm,*.dbm,*.htm,*.html,*.htmls,*.htp,*.htpl,*.htr,
> *.htw,*.ida,*.idc,*.idq,*.mdl,*.sht,*.shtm, *.shtml,*.stm,*.txt,/
> # FILEINCLUDE
> *.asa,*.asp,*.cdx,*.cer,*.cfm,*.dbm,*.htm,*.html,*.htmls,*.htp,*.htpl,*.htr,
> *.htw,*.ida,*.idc,*.idq,*.mdl,*.sht,*.shtm, *.shtml,*.stm,*.txt,/
> PAGEEXCLUDE *.gif,*.css,*.jpg,*.js,*.bat,*.ico,*.swf
> FILEEXCLUDE *.gif,*.css,*.jpg,*.js,*.bat,*.ico,*.swf
> 
> Note I have done a good deal of experimentation around PAGE/FILEINCLUDE
> including adding */ and simply / (as illustrated) as a PAGE and FILEINCLUDE
> option.

When analog starts up, it includes the (equivalent of the) command
  PAGEINCLUDE *.html,*.html,*/
So you can imagine this command at the top of your configuration file.

Now /conamehomepage doesn't match any of the inclusions or exclusions, and a
PAGEINCLUDE is first, so /conamehomepage isn't a page.

If you really want to include everything except named stuff as a page, then
put
  PAGEINCLUDE *
before the PAGEEXCLUDE command.

This is obvious if you run
  analog -settings
It's not obvious if you only read the documentation. :)

-- 
Stephen Turner               http://www.statslab.cam.ac.uk/~sret1/
  Statistical Laboratory, Wilberforce Road, Cambridge, CB3 0WB, England
  "Your account can only be used for a single internet session at any one
   time and for no more than 24 hours in any one day." (NTL terms of use)

+------------------------------------------------------------------------
|  This is the analog-help mailing list. To unsubscribe from this
|  mailing list, go to
|    http://lists.isite.net/listgate/analog-help/unsubscribe.html
|
|  List archives are available at
|    http://www.mail-archive.com/[email protected]/
|    http://lists.isite.net/listgate/analog-help/archives/
|    http://www.tallylist.com/archives/index.cfm/mlist.7
+------------------------------------------------------------------------

Reply via email to