At 12:22 PM -0500 5/10/00, Gilles Detillieux wrote:
>  > Once Again: Can anyone tell me how the ranking is calculated by the
>  > dig-algorithm? is there a formula? does it matter at which position in the
>  > doc the search-term is found?
>
>In the 3.1.x series, position does matter.  Words at the top of the document
>are ranked higher than words closer to the end.  My understanding is that
>this is no longer the case in the 3.2 betas.  As for the actual formulae,
>I don't really know.  Perhaps someone else can shed more light?

Sorry I didn't get to this sooner. The actual formula is a bit 
complicated. In 3.1.x, the formula for a word factor in a document is 
something like this:

score = Sum(all occurrences)
        [1000-(word location)/1000] * _factor

where _factor is the appropriate factor for the given word (e.g. 
text_factor, keyword_factor, ...)

It's a little more complicated when you consider backlink_factor and 
date_factor, but this is about it.

The 3.2 betas are a bit more complicated and there are still some 
scoring issues to be cleaned up, but the result should be much better.

--
-Geoff Hutchison
Williams Students Online
http://wso.williams.edu/

------------------------------------
To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED]
You will receive a message to confirm this.

Reply via email to