According to edwin lin:
> Now I can run boolean "and" or "or" search. The following is the list of 
> "breast and cancer" search result. But why the breast cancer is not listed 
> as the first line but in the 7th line?
...

Well, first of all the multiboost patch only affects ORed results, not
ANDed.  It shouldn't be needed for AND, because all the search words would
have to be found in all documents in this case, so we already have a level
playing field.

> I applied the patch and edited the value of 1000 to 12 according to the 
> value in config.Value expression. see the patched parser.cc.
> 
>   // Duplicate document.  We just need to add the scores together
>             //
>             dm2->score += dm->score;
>             // dnaber: boost matches that contain more than one word
>   (dnaber, 2000-10-22):
>             dm2->score = dm2->score * 12;
>             if (dm->anchor < dm2->anchor)
>                 dm2->anchor = dm->anchor;
> 
> What else should I do? Thanks,

I don't understand where the 12 comes from.  The second argument to
config.Value() is just a default, so different uses of config.Value()
in the code will have different values.  You pick the one that's
appropriate for your purpose.  You can set it to whatever you want,
but my suggestion of using config.Value() in place of a hardcoded
1000 (or 12, or whatever), was so you could try different values out
in your config file without having to recompile and reinstall htsearch.

In any case, this still only affects ORing, not ANDing, so a search for
"breast and cancer" will not be affected by this.  All your search results
will contain both words in the document, somewhere, if you use "and".
They'll be ranked according to the scores of the two words in these
documents, but there's no guarantee that documents with both words in
the title will score higher.  There are other factors involved.  Indeed,
by default I think link description text will affect the score more than
title text will.  You can try tweaking your various *_factor attributes.

-- 
Gilles R. Detillieux              E-mail: <[EMAIL PROTECTED]>
Spinal Cord Research Centre       WWW:    http://www.scrc.umanitoba.ca/~grdetil
Dept. Physiology, U. of Manitoba  Phone:  (204)789-3766
Winnipeg, MB  R3E 3J7  (Canada)   Fax:    (204)789-3930

_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a 
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html

Reply via email to