Is there a setting to make Analog 5.1 (running on Solaris 8) understand
';' as a separator for search arguments rather than the usual '&'?

I am trying to generate internal search reports from an Apache log that
contains hits to the htdig search engine that we use on our site, but I
was getting entries like the following in the Internal Search Query
Report:

  23: application page=2
  18: courses page=2

And these entries in the Internal Search Word Report:

  593: page=2
  132: page=3 

After a bit of search I discovered that htdig, as of version 3.1.5,
started using ';' as the argument separator, as per their explanation at
http://www.htdig.org/FAQ.html#q5.21  The following entries from my
server log files confirm this behavior:

130.91.213.242 - - [03/Jan/2002:06:59:23 -0500] "GET
/cgi-bin/htsearch?words=APPLICATION%20FORM;page=2 HTTP/1.1" 200 10912
130.91.213.242 - - [03/Jan/2002:06:59:38 -0500] "GET
/cgi-bin/htsearch?words=APPLICATION%20FORM;page=3 HTTP/1.1" 200 3850

Based on the reports generated, Analog seems to be treating the
semi-colon as a word seperator rather than as a parameter seperator.  Is
there an easy way to change this behavior?  If not, is there a good way
to have Analog ignore the URI from ";page=" to its conclusion, since the
page=n is always generated as the last argument by htdig?

Any thoughts or recommendations would be greatly appreciated!

--
Best wishes,
Craig A. Haynal
Penn State Graduate School
+------------------------------------------------------------------------
|  This is the analog-help mailing list. To unsubscribe from this
|  mailing list, go to
|    http://lists.isite.net/listgate/analog-help/unsubscribe.html
|
|  List archives are available at
|    http://www.mail-archive.com/[email protected]/
|    http://lists.isite.net/listgate/analog-help/archives/
|    http://www.tallylist.com/archives/index.cfm/mlist.7
+------------------------------------------------------------------------

Reply via email to