Sorry David, didn't realise when I hit reply that I didn;t include the mailing list 
address.

My question/problem/cry for help is below ;)

-------------------------------------------------------------------------------
David Grubb - Internet / Intranet Developer
[EMAIL PROTECTED]       +61 2 9895-7913
Department of Land & Water Conservation
Sydney,  Australia
-------------------------------------------------------------------------------


Hi all

Just tried a few more things on this, but still having problems. I've set the 
description_factor to 0 as suggested, set search_algorithm as exact:1, all other 
indexing and search options have been left at the default vaulues. I've then rebuilt 
the index.

Documents that do not contain the search word are still being returned with scores 
higher than documents with the search word. The wierd thing is these documents don't 
contain the search word at all (ie the word is not present in the HTML source) and 
shouldn't be included in the results.

Any more suggestions?

Cheers
Dave

-------------------------------------------------------------------------------
David Grubb - Internet / Intranet Developer
[EMAIL PROTECTED]       +61 2 9895-7913
Department of Land & Water Conservation
Sydney,  Australia
-------------------------------------------------------------------------------

>>> David Adams <[EMAIL PROTECTED]> 09/25 8:34 pm >>>
> 
> Hi all 
> > Having some trouble with the results of searches, and
hoping someone can offer some advice.  
> > An example of the problem:
searching for the word "email" returns a number of documents, one of
those contains the word "mail" (not "email") and scores higher than a
number of documents that contain the word "email" 
> > In the conf file,
search_algorithm is set to exact:1 synonyms:0.5 endings:0.1 
> > Any ideas? 
> > Thanks in advance
> 
> -------------------------------------------------------------------------------
> David Grubb - Internet / Intranet Developer
> [EMAIL PROTECTED]       +61 2 9895-7913
> Department of Land & Water Conservation
> Sydney,  Australia
> -------------------------------------------------------------------------------

Take a look at the <HEAD> section of that document.  Are there <META>
statements which contain "email" as a keyword, or in the description? 
If there are, then all is explained. 

Another possibility is that you have a number of links to the document
where the text contains the word "email".  You could try adding:

description_factor: 0

to your configuration file and re-making the index.

-- 
 
David J Adams
<[EMAIL PROTECTED]>
Computing Services
University of Southampton

------------------------------------
To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED] 
You will receive a message to confirm this.
List archives:  <http://www.htdig.org/mail/menu.html>
FAQ:            <http://www.htdig.org/FAQ.html>





------------------------------------
To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED]
You will receive a message to confirm this.
List archives:  <http://www.htdig.org/mail/menu.html>
FAQ:            <http://www.htdig.org/FAQ.html>

Reply via email to