At 7:05 PM +0200 9/18/01, Quim Sanmarti wrote:
>The same can be said about Spanish, French, Catalan w.r.t. English (AFAIK)
>and Russian and lots of other languages (I guess); That's why IMHO the
>language-related aspects of searching should allow a high degree of
>flexibility.
And certainly other fuzzy algorithms are needed for different
languages. Soundex and metaphone work OK on somewhat related
languages, but realistically different phonetic mappings would be
needed. For languages like Russian where multiple character encodings
can also be used (IIRC), even the "accents" fuzzy algorithm isn't
going to be very useful.
On the other hand, while I could probably work out a French version
of metaphone and maybe Italian or Spanish, that doesn't cover many
languages. Suggestions (and code) are always welcome.
If I were to suggest a configuration technique for chaining fuzzy
algorithms, it would be an enhancement to search_algorithm, e.g.:
search_algorithm: exact:1.0 endings:0.3 synonyms:0.3 synonyms->endings:0.1
In any case, as I said before, while this is probably a useful
enhancement, it won't be implemented anytime soon unless someone is
willing to provide code. There are only so many hours in the day,
unfortunately.
--
--
-Geoff Hutchison
Williams Students Online
http://wso.williams.edu/
_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html