Dear Geoff/Jim,

Interesting... I will attempt the *_factor attributes first and see if they
make enough of a difference... if not, we will have a look at the scoring
elements in the code and/or piping the extended results through a wrapper...
I'm sure we'll find a recipe out of this discussion that works.

Thanks to both of you for your time.

Best Wishes,
Todd

Todd Hooge
Website Developer
The Communication Initiative
http://www.comminit.com/

-----Original Message-----
From: [EMAIL PROTECTED]
[mailto:[EMAIL PROTECTED]]On Behalf Of Geoff
Hutchison
Sent: Tuesday, July 09, 2002 7:03 AM
To: Jim Cole
Cc: Todd Hooge; ht://Dig General Users List
Subject: Re: [htdig] minimum_word_count?



On Sunday, July 7, 2002, at 08:20  PM, Jim Cole wrote:

>> Is it possible to configure ht://dig to return documents ONLY if there
>> are
>> three or more occurrences of the search term (or phrase) in the
>> document?
>
> I am not aware of any configuration setting that allows this sort
> of functionality. I seem to recall reading that the databases are
> structured in a manner that makes it non-trivial to obtain
> per-document word frequencies.

This is not entirely true, but close enough. Basically, you'd still have
to retrieve all the documents anyway, and then you'd have to filter the
search results carefully. You certainly could code in such a restriction
(i.e. while it's doing the scoring, it counts the number of word
occurrences), but the key word there at the moment is *code*. Since no
one else has really asked for this, it's not high on any of the active
developers' lists.

>> If there was a way we could limit results via the config file as
>> above, this
>> should ensure the results are a little more relevant, without us
>> having to
>> constantly fine-tune or list. Is there a better way to do this?
>
> Have you checked through the weight related attributes to see if
> there is anything there that might help you. In particular, there
> are a lot of *_factor attributes that allow you to adjust the

Todd, you also mention using 3.2.0b4 snapshots. All of the scoring for
3.2 is done in htsearch, so you can quickly and easily change the
factors and adjust the rankings. No reindexing is needed. If you find
weightings that you like better, we'd be interested to know them.

Regards,

--
-Geoff Hutchison
Williams Students Online
http://wso.williams.edu/



-------------------------------------------------------
This sf.net email is sponsored by:ThinkGeek
Stuff, things, and much much more.
http://thinkgeek.com/sf
_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to
<[EMAIL PROTECTED]> with a subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html



-------------------------------------------------------
This sf.net email is sponsored by:ThinkGeek
Two, two, TWO treats in one.
http://thinkgeek.com/sf
_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a 
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html

Reply via email to