On Wednesday, Jul 23, 2003, at 23:20 Asia/Tokyo, Kino wrote:
3. Terminal showed
MacArabic "\x20" does not map to Unicode. MacArabic "\x21" does not map to Unicode. MacArabic "\x22" does not map to Unicode. ...
I noticed that the characters \x20-\x2F, \x3A-\x3F, \x5B-\x5F, \x7B-\x7D, \x81, \x8C, \x93, \x98, \x9B, \xA0-\xA4, \xA6-\xAB, \xAD-\xBA, \xBC-\xBE, \xC0, \xDB-\xDF, \xFB-\xFD are those which have <LR>+ or <RL>+ before Unicode value in
<ftp://ftp.unicode.org/Public/MAPPINGS/VENDORS/APPLE/ARABIC.TXT>
in the following manner:
0x20 <LR>+0x0020 # SPACE, left-right 0x21 <LR>+0x0021 # EXCLAMATION MARK, left-right ... 0x8C <RL>+0x00AB # LEFT-POINTING DOUBLE ANGLE QUOTATION MARK, right-left ...
<LR> and <RL> make sense only in legacy WorldScript technology and I think 0x20, 0x21 ... 0x8C, etc. should be treated as 0x0020, 0x0021 ... 0x00AB, etc. just like those in Windows Arabic CP1256.
0x20 0x0020 #SPACE 0x21 0x0021 #EXCLAMATION MARK ... 0xAB 0x00AB #LEFT-POINTING DOUBLE ANGLE QUOTATION MARK ...
Ideally directional marks would be inserted in right places, but I don't know well how they should work. U+202C, U+202D and U+202E seem to work in TextEdit though.
Kino