Regional Indicator Symbols

Doug Ewell Sat, 02 Jun 2012 15:11:36 -0700

Disclaimer: This post does not propose or encourage any new mechanismsfor Unicode.


Jean-François Colson wrote:

Here is why I don’t like n3680.

N3680 is moot. The Regional Indicator Symbols at U+1F1E6 throughU+1F1FF, based on N3727, were encoded in Unicode 6.0 instead.

ISO-3166-1 defines two-letter symbols for many “pieces of earth” which
are not independent countries.
For example, there’s an ISO-3166-1 symbol for Réunion, an overseas
department of France.
Why would Unicode define a flag for Réunion (RE, also known as FR-RE
in ISO-3166-2) but not for Puy-de-Dôme (FR-63, a metropolitan
department) or Bretagne (Brittany, FR-E, a metropolitan region which
has its own flag:
http://upload.wikimedia.org/wikipedia/commons/7/7d/Flag_of_Brittany.svg)?


Unicode doesn't define any flags.

To answer your underlying question about why N3680 (which was notadopted) defined a symbol for Réunion: because N3680 was based strictlyon which entities were assigned a code element in ISO 3166-1. Everyindividual and organization that tries to second-guess ISO 3166/MA, andmake their own judgments as to what is and isn't a country, runs intothe same problems that the MA has already dealt with. It's almost neverblack and white. The best thing to do in 99.9% of cases is to follow ISO3166-1, and point to it when someone criticizes.

If those flags are used for languages, Brittany has its own language:
Breton (Brezhoneg). So, a support for ISO-3166-2 could be a good Idea.

Flags of nations should never be used to denote languages. Therelationship is not even close to 1-to-1. There are better ways.

Also, the flag of Québec (CA-QC,
http://upload.wikimedia.org/wikipedia/commons/5/5f/Flag_of_Quebec.svg)
could be used to make a difference between European and Canadian
French.

There is a whole wide, wonderful world surrounding the use of standardcodes to identify languages. It's called "language tagging" and if youare interested, read IETF BCP 47 (RFC 5646 and 4647) or visit theietf-languages mailing list.

The short answer is that French generically is "fr", French as spoken inFrance is "fr-FR", Canadian French is "fr-CA", and if you can manage topin down exactly what "European French" means, that would be "fr-150".

Using this mechanism, the best way to identify (for example) CanadianFrench content is with the Basic Latin letters "fr-CA" — NOT with animage of a Canadian flag or any other flag.

Unicode is supposed to be a stable standard: any new character will
remain always and forever.
ISO-3166 is not stable enough to be encoded as it is in Unicode:
countries may change their names, countries may merge together,
countried may be divided into several parts.
In any case, new two-letter codes may be created, existing ones may
become obsolete and, most problematic, when a two-letter code has been
obsolete for at least 5 years, IT MAY BE USED FOR ANOTHER COUNTRY.

ISO 3166-1 was updated several years ago to change this period to 50years.

For these reasons, I think the system proposed by Philippe Verdy would
be better suited to encode flags: the codes would have no meaning by
themselves, but their combinations would be displayed as flags,
whenever possible, by the rendering engine.

Read http://std.dkuug.dk/jtc1/sc2/wg2/docs/n3727.pdf . This mechanism ispart of Unicode, and adding another one such as Philippe's to accomplishbasically the same thing would be a form of duplicate encoding.


--
Doug Ewell | Thornton, Colorado, USA
http://www.ewellic.org | @DougEwell

Regional Indicator Symbols

Reply via email to