> One of the advantages of using Python 2 for parsing is that it can work
> with a complete 32-bit Unicode charset encoding (UTF-8), rather than
> just a locale-specific subset, and includes support for transforming
> many (most) subsets into UTF-8.
My understanding is that you need the catalogs and NLS support built
into Python to take advantage of that, and that means ensuring that the
package maintainer (or if you do source builds on your own) did not use the
--disable-nls switch when compiling. Many do (and there's good reason to).
/d