On 18/04/2026 12:23, [email protected] via Unicode wrote:
The SEW subsequently explained that the actual reason is due to
insufficient evidence of user community that would need to use the
resulting mapping. Despite Win32 being a highly popular platform with
plenty of backwards compatibility and native UCS-2 terminal support,
the specific use cases of installing codepages into Windows NT and
using terminal tiles from Windows 3.1/95/98/ME are not sufficiently
documented, making it difficult for any user communities to form
around it. So it seems like the idea of standardizing legacy Arabic
terminal BMP mappings is a dead end for now.
---
In case it's useful, here's a list of the Unicode mapping tables from
the ICU-TC for DOS, AIX or EBCDIC codepages containing at least one of
the non‑Unicode Arabic Presentation Forms in question, and the various
Unicode mappings (which variously may or may not be to the Private Use
Area, and which variously may or may not round-trip) which they use for
those characters. The codepage charts are also linked for reference.
Code page 420:
https://public.dhe.ibm.com/software/globalization/gcoc/attachments/CP00420.pdf
•
https://github.com/unicode-org/icu-data/blob/main/charset/data/ucm/ibm-420_X120-1999.ucm
◦ AS010000(0x77) → U+0633
◦ AS230000(0x80) → U+0634
◦ AS450000(0x8B) → U+0635
◦ AD450000(0x8D) → U+0636
Code page 421:
https://public.dhe.ibm.com/software/globalization/gcoc/attachments/CP00421.pdf
•
https://github.com/unicode-org/icu-data/blob/main/charset/data/ucm/ibm-4517_P100-2005.ucm
◦ AS010000(0x77) → U+FEB1
◦ AS230000(0x80) → U+FEB5
◦ AS450000(0x8B) → U+FEB9
◦ AD450000(0x8D) → U+FEBD
Code page 864:
https://public.dhe.ibm.com/software/globalization/gcoc/attachments/CP00864.pdf
•
https://github.com/unicode-org/icu-data/blob/main/charset/data/ucm/ibm-864_X110-1999.ucm
◦ AS010000(0xBC) → U+FEB1
◦ AS230000(0xBD) → U+FEB5
◦ AS450000(0xBE) → U+FEB9
◦ AD450000(0xEB) → U+FEBD
•
https://github.com/unicode-org/icu-data/blob/main/charset/data/ucm/ibm-4960_P100-1995.ucm
◦ AS010000(0xBC) → U+F8F6
◦ AS230000(0xBD) → U+F8F5
◦ AS450000(0xBE) → U+F8F4
◦ AD450000(0xEB) → U+F8F7
Code page 868:
https://public.dhe.ibm.com/software/globalization/gcoc/attachments/CP00868.pdf
•
https://github.com/unicode-org/icu-data/blob/main/charset/data/ucm/ibm-868_P100-1995.ucm
◦ AA010006(0x90) → U+F8FB
•
https://github.com/unicode-org/icu-data/blob/main/charset/data/ucm/ibm-868_X100-1995.ucm
◦ AA010006(0x90) → U+FE8E
Code page 918:
https://public.dhe.ibm.com/software/globalization/gcoc/attachments/CP00918.pdf
•
https://github.com/unicode-org/icu-data/blob/main/charset/data/ucm/ibm-918_P100-1995.ucm
◦ AA010006(0x48) → U+F8FB
•
https://github.com/unicode-org/icu-data/blob/main/charset/data/ucm/ibm-918_X100-1995.ucm
◦ AA010006(0x48) → U+FE8E
Code page 1006:
https://public.dhe.ibm.com/software/globalization/gcoc/attachments/CP01006.pdf
•
https://github.com/unicode-org/icu-data/blob/main/charset/data/ucm/ibm-1006_P100-1995.ucm
◦ AA010006(0xB2) → U+F8FB
•
https://github.com/unicode-org/icu-data/blob/main/charset/data/ucm/ibm-1006_X100-1995.ucm
◦ AA010006(0xB2) → U+FE8E
Code page 1008:
https://public.dhe.ibm.com/software/globalization/gcoc/attachments/CP01008.pdf
•
https://github.com/unicode-org/icu-data/blob/main/charset/data/ucm/ibm-1008_P100-1995.ucm
◦ AS010000(0xD0) → U+F8F6
◦ AS230000(0xD2) → U+F8F5
◦ AS450000(0xD4) → U+F8F4
◦ AD450000(0xD6) → U+F8F7
•
https://github.com/unicode-org/icu-data/blob/main/charset/data/ucm/ibm-1008_X110-1999.ucm
◦ AS010000(0xD0) → U+FEB1
◦ AS230000(0xD2) → U+FEB5
◦ AS450000(0xD4) → U+FEB9
◦ AD450000(0xD6) → U+FEBD
Code page 1046:
https://public.dhe.ibm.com/software/globalization/gcoc/attachments/CP01046.pdf
•
https://github.com/unicode-org/icu-data/blob/main/charset/data/ucm/ibm-1046_X110-1999.ucm
◦ AS010000(0x83) → U+FEB1
◦ AS230000(0x84) → U+FEB5
◦ AS450000(0x85) → U+FEB9
◦ AD450000(0x86) → U+FEBD
◦ AA210006(0xA1) → U+FE82
◦ AA310006(0xA2) → U+FE84
◦ AA310406(0xA3) → U+FE88
◦ AA010006(0xA5) → U+FE8E
Code page 1097:
https://public.dhe.ibm.com/software/globalization/gcoc/attachments/CP01097.pdf
•
https://github.com/unicode-org/icu-data/blob/main/charset/data/ucm/ibm-1097_P100-1995.ucm
◦ AA210006(0x46) → U+F8FA
◦ AA010006(0x49) → U+F8FB
◦ AA310006(0x54) → U+F8F9
•
https://github.com/unicode-org/icu-data/blob/main/charset/data/ucm/ibm-1097_X100-1995.ucm
◦ AA210006(0x46) → U+FE82
◦ AA010006(0x49) → U+FE8E
◦ AA310006(0x54) → U+FE84
Code page 1098:
https://public.dhe.ibm.com/software/globalization/gcoc/attachments/CP01098.pdf
•
https://github.com/unicode-org/icu-data/blob/main/charset/data/ucm/ibm-1098_P100-1995.ucm
◦ AA210006(0x88) → U+F8FA
◦ AA010006(0x8B) → U+F8FB
◦ AA310006(0x8F) → U+F8F9
•
https://github.com/unicode-org/icu-data/blob/main/charset/data/ucm/ibm-1098_X100-1995.ucm
◦ AA210006(0x88) → U+FE82
◦ AA010006(0x8B) → U+FE8E
◦ AA310006(0x8F) → U+FE84
Code page 1127:
https://public.dhe.ibm.com/software/globalization/gcoc/attachments/CP01127.pdf
•
https://github.com/unicode-org/icu-data/blob/main/charset/data/ucm/ibm-1127_P100-2004.ucm
◦ AS010000(0x9C) → U+F8F6
◦ AS230000(0x9D) → U+F8F5
◦ AS450000(0x9E) → U+F8F4
◦ AD450000(0x9F) → U+F8F7
◦ AA210006(0xA1) → U+F8FA
◦ AA310006(0xA2) → U+F8F9
◦ AA310406(0xA3) → U+F8F8
◦ AA010006(0xA5) → U+F8FB
—Har.