Re: [I18n] Default charset for locale (is UNICODE !)

Ričardas Čepas Thu, 26 Oct 2000 11:26:59 -0700

On Thu Oct 26 20:18:25 2000 +0300 Alexander Voropay wrote:

> Hi!
> 
>  As it is known, a lot of languages (locales) have more than one
> "encodings" (charsets).
> ru_RU.KOI8-R
> ru.RU.ISO8859-5
> ru.RU.CP-1251
> ja_JP.ISO2022-JP
> ja_JP.Shift-JIS
> ja_JP.EUC-JP
> 
>  Otherwise, locale name defined as :
>  
>    language[.TERRITORY[.Codeset]]
> 
>  So, there are "short" locale names where we'll lost "Charset" information.
> ru
> ru_RU
> ja
> ja_JP
> 
>  I think, "short" locale names should be _aliases_to_UNICODE UTF-8 
> as Universal (default) Charset.
> 
> ru --> ru_RU --> ru_RU.UTF-8
> ja --> ja_JP  --> ja_JP.UTF-8

You may add 'lt' here as well as many others for completeness.

> 
> 
>  Any comments ?
> 

        Me too ;)  After all if you want to use one of many legacy charsets you should 
give some information which one you want to use (I now about nl_langinfo(CODESET) but 
this doesn't always work).

-- 
      ☻ Ričardas Čepas ☺
~~
~
-
Linux-UTF8:   i18n of Linux on all levels
Archive:      http://mail.nl.linux.org/lists/

Re: [I18n] Default charset for locale (is UNICODE !)

Reply via email to