Re: Character name translations

Asmus Freytag Thu, 20 Dec 2012 16:53:57 -0800

On 12/20/2012 2:36 PM, Jukka K. Korpela wrote:

2012-12-20 14:13, David Starner wrote:
It may be useful to try to agree on official or semi-official names for
characters in a language. Such a list hardly needs to cover all ofthe over
100,000 Unicode characters.
Why not? Why should an English speaker sticking a arbitrary character
into a character map program get a name for it but a non-English
speaker not?
For most characters, a “translated” name would be arbitrary. I wouldcompare this to names of biological species. Most species lack namesin most languages, and when names exist, they are often vaguely andinconsistently used.

But when real people, not biologists, want to look up information theyhave precisely two choices: they can look at a visual index (for speciesthat can be arranged visually) or they can look up the scientific namefor the species based on the only thing they know: the local popular name.

That’s why people use scientific (Linnaean) names. We use common namesfor common animals, but it just would not make sense to assign a nameto the millions of insect species in each human language. Thescientific name is a crucial key to information. With Unicodecharacters, both the number and the name act as such keys, though thename is usually descriptive of meaning, too.

Unlike species, all characters for living scripts have popular localnames in at least one language other than English.

It may not be desirable to blindly translate ALL such names into ALLlanguages, but major languages (not only English) may be used by peoplethat are familiar with or study many other languages and scripts. Forthose languages, their community of scholars represents another set ofusers who benefit from translated names.

Finally, for arcane scripts, there's usually an easily translatable partof the character name (think of LATIN LETTER SMALL) and an arbitrarypart of the name (e.g. A) which comes from a transliteration scheme, acatalog number or the like.

If a language doesn't have a unique transliteration scheme for aparticular script, the choices are to either use the same as present inthe Unicode Standard, or to use one from another, culturally morerelevant language (e.g. a French-based instead of and English-basedtransliteration).

So Unicode names should not be translated at all, any more than you
translate General Category values for example.


Why wouldn't you?


Because those values are identifiers.

No, names have multiple uses; especially if you take the formal name asone in a series of "aliases" for each character - that's why it's oftenmore useful to think of translations of the full code charts andcharacter index, instead of "just" the formal names. (The latter, bythemselves are not so useful).

There's an argument that they're generally useful
for programmers only and programming often requires English knowledge,
but if I were explaining the character categories in Esperanto, I
would certainly say that Sm is matematikaj simboloj or Simbolo
Matematika, not act like "Symbol, Math" should have any importance to
my audience.
We can and often should *explain* meanings of identifiers in differentlanguages, but that’s different from naming things. The value “Sm” hasa technical meaning, and it is not identical with the common-languageexpression “mathematical symbol” or its variants, though rather close.

The linguistic content of the short labels is indeed limited, however, Ican see good reasons to provide alternate abbreviations for characters,e.g. for ZWSP or WJ, because these terms are used in places where theydo not act as identifiers.

A./

Re: Character name translations

Reply via email to