The LOGFORMAT patterns currently match minimally, i.e. variables only
consume characters different from the delimiter character immediately
following the variable.
For example with
LOGFORMAT (%S %j %u [%d/%M/%Y:%h:%n:%j] "%j%w%r%wHTTP%j" %c %b "%f"
"%B")
the entry
1.2.3.4 - - [28/Aug/2001:05:23:08 +0000] "GET / HTTP/1.0" 200 18835
"http://localhost/show.cgi?title="test"" "Mozilla/4.0"
is marked as a corrupted log entry.
As a simple enhancement to log pattern interpretation I suggest to match
if and only if all static characters following a variable, including
whitespace, are matched, so doublequote-blank would terminate the
referrer string whereas doublequote-nonblank would not.
In addition having an option to rewrite log file lines before they are
parsed would be nice so non-standard log formats could easily be adapted
before they get discarded as invalids.
--
Klaus Johannes Rusch
[EMAIL PROTECTED]
http://www.atmedia.net/KlausRusch/
+------------------------------------------------------------------------
| This is the analog-help mailing list. To unsubscribe from this
| mailing list, go to
| http://lists.isite.net/listgate/analog-help/unsubscribe.html
|
| List archives are available at
| http://www.mail-archive.com/[email protected]/
| http://lists.isite.net/listgate/analog-help/archives/
| http://www.tallylist.com/archives/index.cfm/mlist.7
+------------------------------------------------------------------------