I'd like to add some words...

On Wednesday, Jul 23, 2003, at 23:20 Asia/Tokyo, Kino wrote:

3. Terminal showed

MacArabic "\x20" does not map to Unicode.
MacArabic "\x21" does not map to Unicode.
MacArabic "\x22" does not map to Unicode.
...

I noticed that the characters \x20-\x2F, \x3A-\x3F, \x5B-\x5F, \x7B-\x7D, \x81, \x8C, \x93, \x98, \x9B, \xA0-\xA4, \xA6-\xAB, \xAD-\xBA, \xBC-\xBE, \xC0, \xDB-\xDF, \xFB-\xFD are those which have <LR>+ or <RL>+ before Unicode value in


<ftp://ftp.unicode.org/Public/MAPPINGS/VENDORS/APPLE/ARABIC.TXT>

in the following manner:

0x20    <LR>+0x0020       # SPACE, left-right
0x21    <LR>+0x0021       # EXCLAMATION MARK, left-right
...
0x8C    <RL>+0x00AB       # LEFT-POINTING DOUBLE ANGLE QUOTATION MARK, right-left
...

<LR> and <RL> make sense only in legacy WorldScript technology and I think 0x20, 0x21 ... 0x8C, etc. should be treated as 0x0020, 0x0021 ... 0x00AB, etc. just like those in Windows Arabic CP1256.

0x20    0x0020  #SPACE
0x21    0x0021  #EXCLAMATION MARK
...
0xAB    0x00AB  #LEFT-POINTING DOUBLE ANGLE QUOTATION MARK
...

Ideally directional marks would be inserted in right places, but I don't know well how they should work. U+202C, U+202D and U+202E seem to work in TextEdit though.


Kino




Reply via email to