Quoting Chris Leonard (cjlhomeaddr...@gmail.com):

These examples are indeed the most common exceptions. Let's add my
experience about them.

> 1)
> en, en_US and en_GB are commonly found because
> a) there are a number of distinct spelling differences (orthography)
> that you do not see with Spanish.
> b) the burden of maintaining the "translations" is actually quite minimal.
> In Gnome, there is even a PERL script that takes a first pass at this
> for you, with pretty good results.

The main problem here is deciding about the *original*
strings. Depending on the maintainer of the software, they're either
en_US, en_GB or, most often en_airport (the same English I'm writing
right now...:-)).

Standardi{z|s}ing on one of both (not to mention en_AU, en_ZA, etc.)
is always a good idea, but requires a quite good knowledge of English,
most often available to native speakers only.

In Debian, we try to standardize on en_US when we can (indeed, when we
have someone reviewing the texts). Not because it's better or
whatever, but mostly because it seems to be the most widely used, so
that requires less modifications. We eve have a debian-l10n-english
mailing list aimed at encouraging eviews of texts, with a few
volunteers trying to help package maitnainers, documentation writers,
our publicity team, etc. to get good texts out.

And, nearly always we don't have any en_GB translation. Probably
because the often very technical texts we have do not require this.


> 2)
> zh_CN and zh-TW are in fact quite different, you never see just zh (in
> my experience).   When present, both zh_CN and zh-TW typically get
> well maintained.  Occasionally one will also see zh_HK, but it is less
> common.

Those both are indeed different variations in *written* forms of the
Chinese language*s* (Simplified, used mostly in mainland China and
Traditional used in Taiwan, Hong-Kong, Singapore, etc.). Actually we
should use zh@simplified and zh@traditional (modifiers) rather than zh_CN for
Simplified and zh_TW for Traditional (country variants).

Using country variants instead of modifiers for Chinese is common
practice (we used it in Debian) but actually bad practice as it leaves
people using zh_GK or zh_SG out of the game (they have to add zh_TW as
alternative in their locale settings).


> 
> 3)
> pt_PT and pt_BR are very frequently found.  I don't know the
> distinctions well, but when present, they both get well maintained.

Yet another difference established by common practice. Indeed,
Brazilian is different enough from Portuguese to nearly warrant its
own ISO code, which would make things better.

Most software use pt.po and pt_BR.po files. A mistake would be using
pt_BR and pt_PT as "continental" Portuguese is also used in former
Portuguese colonies and there are locales for some of them.


> 
> 4)
> Occasionally one will see de_DE and de_CH, although much more rarely,
> and generally with much less completeness on de_CH

I've seen very few of these and I often try to discourage them. From
what I heard of German and Swiss fellows, the difference is more is
spoken language than written one.

The same stands for French, where the written language is standardized
over French-speaking countries with few enough variants (the most well
known is the way to say 7x, 8x, 9x numbers, between France and
Belgium/switzerland).


> 
> 5)
> fa and fa_AF seems to be an important distinction as well (Iranian
> Persian vs Afghani Dari).

I have not enough experience about these. In Debian we have a few
Persian translations and all of them us "fa".


Another common variant case is bn_IN and bn_BD. I'm having very hard
times understanding if that's for real reasons of for political
reasons. Unfortunately, politics often enters such things and
everything related to languages and countries becomes sensitive one
day or another.

My last story about this are the two variants of Serbian : ekavian and
ijekavian. I still remember a meeting at last Debian conference (in
Banja Luka, Republika srpska, part of federation of Bosnia and
Herzegovina) where I was dropped into something that was looking like
Dayton negotiations in the 90's, between Serbs from Serbia and Serbs
from Republika Srpska. Both wanted to do their own work and, believe
me, you really don't want to be in the middle of this..:-)

As a result, we now have "sr.po" translations for Serbian"ekavian"
(the form used in Serbia) and sr@ijekavian for "ijekavian" (the form
used in the Serb part of Bosnia). My proposal to use sr_BS was
completely ruled out by locals....for political reasons (even though
they are part of Bosnia, they don't feel like this). My proposal to
use "bs" (Bosnian) which is also existing was also ruled out even
though "bs" is actually exactly s...@ijekavian....but only used by
people in the "muslim" part of Bosnia. The conclusion of this meeting
was "Welcome to the Balkans".

Interesting game, isn't it?



------------------------------------------------------------------------------
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and 
threat landscape has changed and how IT managers can respond. Discussions 
will include endpoint security, mobile security and the latest in malware 
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
_______________________________________________
Translate-pootle mailing list
Translate-pootle@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/translate-pootle

Reply via email to