At 7:05 PM +0200 9/18/01, Quim Sanmarti wrote:
>The same can be said about Spanish, French, Catalan w.r.t. English (AFAIK)
>and Russian and lots of other languages (I guess); That's why IMHO the
>language-related aspects of searching should allow a high degree of
>flexibility.

And certainly other fuzzy algorithms are needed for different 
languages. Soundex and metaphone work OK on somewhat related 
languages, but realistically different phonetic mappings would be 
needed. For languages like Russian where multiple character encodings 
can also be used (IIRC), even the "accents" fuzzy algorithm isn't 
going to be very useful.

On the other hand, while I could probably work out a French version 
of metaphone and maybe Italian or Spanish, that doesn't cover many 
languages. Suggestions (and code) are always welcome.

If I were to suggest a configuration technique for chaining fuzzy 
algorithms, it would be an enhancement to search_algorithm, e.g.:

search_algorithm: exact:1.0 endings:0.3 synonyms:0.3 synonyms->endings:0.1

In any case, as I said before, while this is probably a useful 
enhancement, it won't be implemented anytime soon unless someone is 
willing to provide code. There are only so many hours in the day, 
unfortunately.

-- 
--
-Geoff Hutchison
Williams Students Online
http://wso.williams.edu/

_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a 
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html

Reply via email to