RE: Localized names of character ranges

Marco Cimarosti Tue, 03 Dec 2002 02:27:55 -0800

Mark Davis wrote:
> While not a trivial task (about 400 terms), it is many, many 
> times easier than translating all the significant character
> names. That might someday be worth considering for the
> Common XML Locale Repository
> (http://oss.software.ibm.com/icu/locale/).


The problem is not the number of terms involved (400 strings is not a big
deal: it corresponds to a small localization project), but rather the utter
idiosyncrasy of Unicode-related terminology.

Terms such as "title case", "caseless", "reordering" or "combining" are
nearly impossible to translate satisfactorily in other languages, and even
simple terms such as "character", "letter" or "ideograph", can be tricky, if
they have to remain distinguished from each other.

I tried myself to translate the Unicode glossary in Italian, but I still
have to find satisfactory translations for several entries, although I
knocked to experts of disciplines ranging from typography (I still don't
have a valid equivalent for the "case" in "case sensitive", etc.) to
Hebraism (I am still odds with "cantillation marks").

IMHO, if such an effort is really worth doing, it should be organized and
promoted by the Unicode Consortium itself, rather than in an OEM library
like ICU.

I suggest that the 400-odd property and value name be listed in a text file
on the Unicode FTP site (with each English term well commented and
explained) and translations be collected on a voluntary basis has was done
for the "What's Unicode?" text. The copyright on this material should grant
free and unrestricted usage to any implementation such as ICU.

_ Marco

RE: Localized names of character ranges

Reply via email to