According to Evaldas Imbrasas: > I use htdig 3.1.6 on linux. Search database has only ~5000 documents. I > just tried to search for 'something*' (keyword + star), and two best > matches displayed 109 'empty star' images (yes, I counted them) in > search results page! The third match had 5 'full star' images, the > fourth one - 4 'full stars' and so on. It does not happen everytime I > search for some keyword + *, but it did happened a few times with > different keywords + * (fuzzy search). It never happened for regular > searches (without *). > > Configuration is: > max_stars: 5 > star_image: star.gif > star_blank: star_blank.gif # (empty stars I was writing above) > search_algorithm: exact:1 prefix:0.9 > prefix_match_character: * > backlink_factor: 500 > description_factor: 200 > title_factor: 100 > text_factor: 1 > keywords_factor: 0 > meta_description_factor: 0 > multimatch_factor: 10 > > Any idea what's going on?
Apparently there's something very wierd happening with scoring of the prefix fuzzy searches. You don't mention what your locale is set to, but if you use a locale in which the decimal point is a comma, you may have to use a comma rather than a period for the prefix weight in search_algorithm. It may be that all prefix matches are getting a score of 0, so if there are no exact matches, it can't figure out the maximum score correctly. I'll look at the code to confirm this hunch, but do check your locale and maybe try changing the prefix weight. See http://www.htdig.org/attrs.html#locale and http://www.htdig.org/attrs.html#search_algorithm -- Gilles R. Detillieux E-mail: <[EMAIL PROTECTED]> Spinal Cord Research Centre WWW: http://www.scrc.umanitoba.ca/~grdetil Dept. Physiology, U. of Manitoba Phone: (204)789-3766 Winnipeg, MB R3E 3J7 (Canada) Fax: (204)789-3930 _______________________________________________ htdig-general mailing list <[EMAIL PROTECTED]> To unsubscribe, send a message to <[EMAIL PROTECTED]> with a subject of unsubscribe FAQ: http://htdig.sourceforge.net/FAQ.html

