The LOGFORMAT patterns currently match minimally, i.e. variables only
consume characters different from the delimiter character immediately
following the variable.

For example with

LOGFORMAT (%S %j %u [%d/%M/%Y:%h:%n:%j] "%j%w%r%wHTTP%j" %c %b "%f"
"%B")

the entry
                                                                                       
                                                                                       
                       
1.2.3.4 - - [28/Aug/2001:05:23:08 +0000] "GET / HTTP/1.0" 200 18835
"http://localhost/show.cgi?title="test""; "Mozilla/4.0"

is marked as a corrupted log entry.

As a simple enhancement to log pattern interpretation I suggest to match
if and only if all static characters following a variable,  including
whitespace, are matched, so doublequote-blank would terminate the
referrer string whereas doublequote-nonblank would not.

In addition having an option to rewrite log file lines before they are
parsed would be nice so non-standard log formats could easily be adapted
before they get discarded as invalids.


-- 
Klaus Johannes Rusch
[EMAIL PROTECTED]
http://www.atmedia.net/KlausRusch/

+------------------------------------------------------------------------
|  This is the analog-help mailing list. To unsubscribe from this
|  mailing list, go to
|    http://lists.isite.net/listgate/analog-help/unsubscribe.html
|
|  List archives are available at
|    http://www.mail-archive.com/[email protected]/
|    http://lists.isite.net/listgate/analog-help/archives/
|    http://www.tallylist.com/archives/index.cfm/mlist.7
+------------------------------------------------------------------------

Reply via email to