[Serge]
> def search_key(s):
>     de_str = unicodedata.normalize("NFD", s)
>     return ''.join(cp for cp in de_str if not
>                    unicodedata.category(cp).startswith('M'))

Lovely bit of code - thanks for posting it!

You might want to use "NFKD" to normalize things like LATIN SMALL
LIGATURE FI and subscript/superscript characters as well as diacritics.

-- 
Richie Hindle
[EMAIL PROTECTED]
-- 
http://mail.python.org/mailman/listinfo/python-list

Reply via email to