Re: Umlauts et al.

Joe Blaylock Tue, 16 Nov 2010 21:33:15 +0100

> Alternatively, we can try to be more fancy and attempt some
> language-specific analysis and treatment, so depending on the language
> of the document and/or of the field used, we would do various stuff to
> the text.


Would something like Unidecode help?
http://www.tablix.org/~avian/blog/archives/2009/01/unicode_transliteration_in_python/

We could update author records to have several transliterations as
alternate names.

Joe

Re: Umlauts et al.

Reply via email to