Hi,
thanks for your answer. It helped me a lot, but it also lead me to the next
problem...
I reported the error (missing quotes around strings) to BEA who sent me a
patch. Now my logs look like this:
#Version: 1.0
#Fields: date time c-ip cs-username cs-method cs-uri-stem sc-status
cs-bytes cs-version cs(User-Agent) cs(Referer)
#Software: WebLogic
2001-05-18 11:08:37 178.194.48.186 - GET
/mediadb/menu.html 304 - - "Mozilla/4.73 [de] (WinNT;
U)" "http://unix13:15751/links/links1.html"
2001-05-18 11:08:37 178.194.48.186 - GET
/mediadb/CSS/CatalogueCSS.css 304 - - "Mozilla/4.73 [de]
(WinNT; U)" -
...
I did not specify a LOGFORMAT in analog.cfg. Analog detects W3C extended
format, but still reports errors. I turned DEBUG MODE on, this is the
output:
F: Detect that it's in W3 extended format
./analog: Warning M: Logfile /home/beh09/download/analog-5.0/test.log
contains
lines with no bytes: byte counts may be low
(For help on all errors and warnings, see docs/errors.html)
C: 2001-05-18 11:08:37 178.194.48.186 - GET
/mediadb/CSS/CatalogueCSS.css 304 - - "Mozilla/4.73 [de]
(WinNT; U)" -
C:
...
S: Corrupt lines: 30
./analog: Warning L: Large number of corrupt lines in logfile
/home/beh09/download/analog-5.0/test.log: try different LOGFORMAT
Current logfile format:
#Fields:<W3 extended format string>\n
#%j\n
%Y-%m-%d%w%h:%n:%j%w%S%w%j%w%j%w%r%w%c%w%j%w%j%w"%B"%w"%f"\n
Analog can now detect the content of the "cs(Referer)"-field if the field
contains a value. If it does not contain a value and the symbol "-" is
displayed instead, Analog cannot detect the line and marks it as corrupt.
After removing the lines with empty "cs(Referer)"-fields from the log,
Analog processed the file without errors.
Can you still find any inconsistencies between my log and the
W3C-specification? If not, what can I do to make Analog process the lines
with empty "cs(Referer)"-fields?
Thank you very much for your help!
Jan
> -----Urspr�ngliche Nachricht-----
> Von: Stephen Turner [mailto:[EMAIL PROTECTED]]
> Gesendet am: Donnerstag, 17. Mai 2001 11:41
> An: [EMAIL PROTECTED]
> Betreff: Re: [analog-help] Problems processing W3C Extended logfiles
> generated by BEA Weblog ic Application Server
>
> On Wed, 16 May 2001 [EMAIL PROTECTED] wrote:
>
> > Hi,
> >
> > I currently face problems running Analog. It won't display
> my extended
> > logfiles. No, it is not a problem with missing dates ...
> > I read the Mailing list where I found comments like
> >
> > Analog will auto-detect the format of W3C Extended logfiles. You
> > don't need to provide a format.
> >
> > My logfile (generated by BEA Weblogic Application Server)
> looks like this:
> >
> > #Version: 1.0
> > #Fields: date time c-ip cs-username cs-method
> cs-uri-stem sc-status
> > cs-bytes cs-version cs(User-Agent) cs(Referer)
> > #Software: WebLogic
> > 2001-05-16 14:48:36 178.194.48.186 - GET
> > /mediadb/menu.html 304 - - Mozilla/4.0 (compatible;
> > MSIE 5.5; Windows NT 4.0) http://unix63:15447/mediadb/index.html
> > 2001-05-16 14:48:36 178.194.48.186 - GET
> > /mediadb/index.html 304 - - Mozilla/4.0 (compatible;
> > MSIE 5.5; Windows NT 4.0) http://unix13:15751/links/links1.html
> > ...
> >
> > (Fields are tab-separated).
> >
> > I took a visit to http://www.w3.org/TR/WD-logfile.html (W3C
> Working Draft
> > WD-logfile-960323), where they define entries in the
> field-directive of the
> > kind prefix(header) in the chapter "Fields". In my eyes, my
> field definition
> > meets the specification.
> >
>
> No, in fact it doesn't. The spec mandates that the browser
> and referrer
> should be in quotes. Also that any whitespace (not just tabs)
> is a field
> separator, unless it's in an item in quotes. (Hence the
> browser above is
> actually seven fields!)
>
> You can write a LOGFORMAT to deal with this format. Get back
> to this list if
> you need help with that.
>
> Please also file a bug report with your browser vendor.
>
> --
> Stephen Turner http://www.statslab.cam.ac.uk/~sret1/
> Statistical Laboratory, Wilberforce Road, Cambridge, CB3
> 0WB, England
> "Your account can only be used for a single internet
> session at any one
> time and for no more than 24 hours in any one day." (NTL
> terms of use)
>
> +-------------------------------------------------------------
> -----------
> | This is the analog-help mailing list. To unsubscribe from this
> | mailing list, go to
> | http://lists.isite.net/listgate/analog-help/unsubscribe.html
> |
> | List archives are available at
> | http://www.mail-archive.com/[email protected]/
> | http://lists.isite.net/listgate/analog-help/archives/
> | http://www.tallylist.com/archives/index.cfm/mlist.7
> +-------------------------------------------------------------
> -----------
>
+------------------------------------------------------------------------
| This is the analog-help mailing list. To unsubscribe from this
| mailing list, go to
| http://lists.isite.net/listgate/analog-help/unsubscribe.html
|
| List archives are available at
| http://www.mail-archive.com/[email protected]/
| http://lists.isite.net/listgate/analog-help/archives/
| http://www.tallylist.com/archives/index.cfm/mlist.7
+------------------------------------------------------------------------