Followup to:  <[EMAIL PROTECTED]>
By author:    [EMAIL PROTECTED]
In newsgroup: linux.utf8
> 
> > Character Set Encoding of Tags:
> > ===============================
> >
> > UTF-8 is the default encoding for tag data.  Unfortunately UTF-8 muffed
> > it for Asian languages by doing the equivalent of giving the same
> > character codes to English, Russian, and Greek letters.  
> 
> It's interesting that Japanese and Chinese, which are unrelated languages,
> are sometimes mutually understandable when written, but somehow use totally
> different scripts.
> 

Also, there would hardly have been any hurt feelings if U+0065
(Latin), U+0391 (Greek), and 0+0410 (Cyrillic) had been unified.  It
would just not have saved enough code points to bother.

As far as "English" and "Russian" are concerned, the various
Latin-script and the various Cyrillic-script languages have been
unified for a long, long time.

Things like U+212A and U+212B should never have been allowed to
happen, on the other hand, IMNSHO.

        -hpa
-- 
<[EMAIL PROTECTED]> at work, <[EMAIL PROTECTED]> in private!
"Unix gives you enough rope to shoot yourself in the foot."
http://www.zytor.com/~hpa/puzzle.txt    <[EMAIL PROTECTED]>
--
Linux-UTF8:   i18n of Linux on all levels
Archive:      http://mail.nl.linux.org/linux-utf8/

Reply via email to