According to Tony Crockford:
> The only thing I can see that might be at the root of the problem is
> that, since the *keyword* list is built by the scripting program from
> field data, punctuation and *bad words* are getting in. ( ' , - the, in,
> etc.) Could this be the answer? If so why? since when searching
> without restrictions adding punctuation to my keyed in words does not
> fail.
...
> <input type=hidden name=keywords value="Sandwell Cradley Heath Everyday
> Life Houses and Homes"> at the search form works fine.
>
> <input type=hidden name=keywords value="Sandwell Cradley Heath Everyday
> Life Leisure, Sport and Pastimes"> at the search form doesn't
...
> Am I right in thinking that keywords added in this way follow different
> rules to words entered directly into a search box, if so how do I
> configure htdig to ignore punctuation and bad words and allow the search
> to proceed normally?
Yes, the parsing of the keywords input parameter is much more
simple-minded than the parsing of the words input parameter. It only
allows white space (space, tab, newline) as separators, as well as Ctrl-A
which is the separator used for multiple input parameter definitions,
e.g. if you use checkboxes for building keywords input parameters.
There's no configuration option to change this. You need either to
clean up the keywords list you give to htsearch, or you need to modify
htsearch to do better parsing of this parameter. Right now, this is at
line 179 in 3.1.5's htsearch/htsearch.cc:
requiredWords.Create(config["keywords"], " \t\r\n\001");
You'd either need to add all punctuation characters you're likely
to encounter to the list at the right, or you'd need to build up the
StringList some other way.
--
Gilles R. Detillieux E-mail: <[EMAIL PROTECTED]>
Spinal Cord Research Centre WWW: http://www.scrc.umanitoba.ca/~grdetil
Dept. Physiology, U. of Manitoba Phone: (204)789-3766
Winnipeg, MB R3E 3J7 (Canada) Fax: (204)789-3930
_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html