AFAIK, there is no authoritative mapping of human languages to either character encodings (sets of glyphs) or sovereign states (aka countries). There are authoritative, language-neutral definitions for each of these: ISO codes for languages and countries and IANA "character set names".

The only thing I have found along these lines in the mapping Microsoft uses in it's own character set conversion libraries. Have a look at http://msdn.microsoft.com/library/en-us/intl/nls_9l2r.asp

Personally, I'd like to see this mapping a) tranlated to use ISO codes and IANA names, b) in XML format and c) approved by the IETF.

IANA Character Sets
http://www.iana.org/assignments/character-sets

Google Advanced Search is very helpful in finding the following. Search for documents in the desired language w/ "ISO 639" or "ISO 3166".

Country Names (ISO 3166)

English: http://www.davros.org/misc/iso3166.html
French: http://www.din.de/gremien/nas/nabd/iso3166ma/codlstp1/fr_lstp1.html
German: http://mdz2.bib-bvb.de/hist/info/tools/countrycode.html

Language Names (ISO 639)

English: http://ftp.ics.uci.edu/pub/ietf/http/related/iso639.txt
French: http://www.termisti.refer.org/iso639.htm
German: http://www.allegro-c.de/allegro/formate/sprachen.htm

hth,
Charles Reitzel



At 03:29 PM 1/30/2003 -0500, Peter VanDijck wrote:
Anyone know where I can find a list of languages, in English and also in
those other languages. Also countries where they are spoken and other
info if available. And preferably in some kind of machine readable
format....
Peter
--
http://cms-list.org/
more signal, less noise.
--
http://cms-list.org/
more signal, less noise.


Reply via email to