Patrick Andries scripsit:
> Small question again.
> 
> Why is U+17C1 KHMER VOWEL SIGN E of  General  category Mc (Mark, Spacing 
> Combining) while similar signs in Lao and Thai, related scripts, are of 
> General category Lo (Letter, Other) ?
> 
> See U+0E40 THAI CHARACTER SARA E and U+0EC0 LAO VOWEL SIGN E, I believe 
> these signs are also placed on the left of the consonant affected.

Thai (and Lao, whose encoding closely parallels that of Thai) are
encoded in Unicode on unique principles:  by a straight left-to-right
typewriter-style encoding.  This was done for compatibility with the
pervasive Thai 8-bit standard.  It also means that for collation purposes
what are historically left-side vowels must be moved after the following
consonant.

Note that the Thai characters are not labeled LETTER or VOWEL SIGN or
what have you, but simply CHARACTER.

-- 
Only do what only you can do.               John Cowan <[EMAIL PROTECTED]>
  --Edsger W. Dijkstra's advice             http://www.reutershealth.com
    to a student in search of a thesis      http://www.ccil.org/~cowan

Reply via email to