>>>>> On Mon, 06 Jun 2016, Mart Raudsepp wrote: > Usually only two letter language codes suffice, but can be limited with > country codes with a 'll_CC' formatting, where 'll' is the language code > and 'CC' is the country code, e.g en_GB. Some rare languages also have > three letter language codes.
s/country code/territory code/g Question related to this, do we take the opportunity to standardise the values? Looks like the vast majority follows language[_territory][@modifier] specified by POSIX [1] but some don't. Also there are a few duplicates, like sr@Latn / sr@latin and uz@Cyrl / uz@cyrillic. I suggest that we adhere to the BCP 47 [2] names if possible (which would be Latn and Cyrl for the examples mentioned). Ulrich [1] http://pubs.opengroup.org/onlinepubs/9699919799/basedefs/V1_chap08.html#tag_08_02 [2] http://www.rfc-editor.org/rfc/bcp/bcp47.txt
pgpB_tZMQVJkR.pgp
Description: PGP signature
