Todd Hooge's bits of Tue, 11 Jun 2002 translated to: >We are using ht://Dig version 3.2.0b4. > >Is it possible to configure ht://dig to return documents ONLY if there are >three or more occurrences of the search term (or phrase) in the document?
I am not aware of any configuration setting that allows this sort of functionality. I seem to recall reading that the databases are structured in a manner that makes it non-trivial to obtain per-document word frequencies. If you were to write a wrapper for htsearch (see for example http://www.htdig.org/FAQ.html#q4.7) and configure for a large excerpt length (http://www.htdig.org/attrs.html#excerpt_length), you could filter the output yourself. >If there was a way we could limit results via the config file as above, this >should ensure the results are a little more relevant, without us having to >constantly fine-tune or list. Is there a better way to do this? Have you checked through the weight related attributes to see if there is anything there that might help you. In particular, there are a lot of *_factor attributes that allow you to adjust the manner in which documents are scored. Jim ------------------------------------------------------- This sf.net email is sponsored by:ThinkGeek We have stuff for geeks like you. http://thinkgeek.com/sf _______________________________________________ htdig-general mailing list <[EMAIL PROTECTED]> To unsubscribe, send a message to <[EMAIL PROTECTED]> with a subject of unsubscribe FAQ: http://htdig.sourceforge.net/FAQ.html

