On 18/04/2026 12:23, [email protected] via Unicode wrote:

The SEW subsequently explained that the actual reason is due to insufficient evidence of user community that would need to use the resulting mapping. Despite Win32 being a highly popular platform with plenty of backwards compatibility and native UCS-2 terminal support, the specific use cases of installing codepages into Windows NT and using terminal tiles from Windows 3.1/95/98/ME are not sufficiently documented, making it difficult for any user communities to form around it. So it seems like the idea of standardizing legacy Arabic terminal BMP mappings is a dead end for now.


---


In case it's useful, here's a list of the Unicode mapping tables from the ICU-TC for DOS, AIX or EBCDIC codepages containing at least one of the non‑Unicode Arabic Presentation Forms in question, and the various Unicode mappings (which variously may or may not be to the Private Use Area, and which variously may or may not round-trip) which they use for those characters.  The codepage charts are also linked for reference.


Code page 420: https://public.dhe.ibm.com/software/globalization/gcoc/attachments/CP00420.pdf

https://github.com/unicode-org/icu-data/blob/main/charset/data/ucm/ibm-420_X120-1999.ucm
  ◦ AS010000(0x77) → U+0633
  ◦ AS230000(0x80) → U+0634
  ◦ AS450000(0x8B) → U+0635
  ◦ AD450000(0x8D) → U+0636


Code page 421: https://public.dhe.ibm.com/software/globalization/gcoc/attachments/CP00421.pdf

https://github.com/unicode-org/icu-data/blob/main/charset/data/ucm/ibm-4517_P100-2005.ucm
  ◦ AS010000(0x77) → U+FEB1
  ◦ AS230000(0x80) → U+FEB5
  ◦ AS450000(0x8B) → U+FEB9
  ◦ AD450000(0x8D) → U+FEBD


Code page 864: https://public.dhe.ibm.com/software/globalization/gcoc/attachments/CP00864.pdf

https://github.com/unicode-org/icu-data/blob/main/charset/data/ucm/ibm-864_X110-1999.ucm
  ◦ AS010000(0xBC) → U+FEB1
  ◦ AS230000(0xBD) → U+FEB5
  ◦ AS450000(0xBE) → U+FEB9
  ◦ AD450000(0xEB) → U+FEBD

https://github.com/unicode-org/icu-data/blob/main/charset/data/ucm/ibm-4960_P100-1995.ucm
  ◦ AS010000(0xBC) → U+F8F6
  ◦ AS230000(0xBD) → U+F8F5
  ◦ AS450000(0xBE) → U+F8F4
  ◦ AD450000(0xEB) → U+F8F7


Code page 868: https://public.dhe.ibm.com/software/globalization/gcoc/attachments/CP00868.pdf

https://github.com/unicode-org/icu-data/blob/main/charset/data/ucm/ibm-868_P100-1995.ucm
  ◦ AA010006(0x90) → U+F8FB

https://github.com/unicode-org/icu-data/blob/main/charset/data/ucm/ibm-868_X100-1995.ucm
  ◦ AA010006(0x90) → U+FE8E


Code page 918: https://public.dhe.ibm.com/software/globalization/gcoc/attachments/CP00918.pdf

https://github.com/unicode-org/icu-data/blob/main/charset/data/ucm/ibm-918_P100-1995.ucm
  ◦ AA010006(0x48) → U+F8FB

https://github.com/unicode-org/icu-data/blob/main/charset/data/ucm/ibm-918_X100-1995.ucm
  ◦ AA010006(0x48) → U+FE8E


Code page 1006: https://public.dhe.ibm.com/software/globalization/gcoc/attachments/CP01006.pdf

https://github.com/unicode-org/icu-data/blob/main/charset/data/ucm/ibm-1006_P100-1995.ucm
  ◦ AA010006(0xB2) → U+F8FB

https://github.com/unicode-org/icu-data/blob/main/charset/data/ucm/ibm-1006_X100-1995.ucm
  ◦ AA010006(0xB2) → U+FE8E


Code page 1008: https://public.dhe.ibm.com/software/globalization/gcoc/attachments/CP01008.pdf

https://github.com/unicode-org/icu-data/blob/main/charset/data/ucm/ibm-1008_P100-1995.ucm
  ◦ AS010000(0xD0) → U+F8F6
  ◦ AS230000(0xD2) → U+F8F5
  ◦ AS450000(0xD4) → U+F8F4
  ◦ AD450000(0xD6) → U+F8F7

https://github.com/unicode-org/icu-data/blob/main/charset/data/ucm/ibm-1008_X110-1999.ucm
  ◦ AS010000(0xD0) → U+FEB1
  ◦ AS230000(0xD2) → U+FEB5
  ◦ AS450000(0xD4) → U+FEB9
  ◦ AD450000(0xD6) → U+FEBD


Code page 1046: https://public.dhe.ibm.com/software/globalization/gcoc/attachments/CP01046.pdf

https://github.com/unicode-org/icu-data/blob/main/charset/data/ucm/ibm-1046_X110-1999.ucm
  ◦ AS010000(0x83) → U+FEB1
  ◦ AS230000(0x84) → U+FEB5
  ◦ AS450000(0x85) → U+FEB9
  ◦ AD450000(0x86) → U+FEBD
  ◦ AA210006(0xA1) → U+FE82
  ◦ AA310006(0xA2) → U+FE84
  ◦ AA310406(0xA3) → U+FE88
  ◦ AA010006(0xA5) → U+FE8E


Code page 1097: https://public.dhe.ibm.com/software/globalization/gcoc/attachments/CP01097.pdf

https://github.com/unicode-org/icu-data/blob/main/charset/data/ucm/ibm-1097_P100-1995.ucm
  ◦ AA210006(0x46) → U+F8FA
  ◦ AA010006(0x49) → U+F8FB
  ◦ AA310006(0x54) → U+F8F9

https://github.com/unicode-org/icu-data/blob/main/charset/data/ucm/ibm-1097_X100-1995.ucm
  ◦ AA210006(0x46) → U+FE82
  ◦ AA010006(0x49) → U+FE8E
  ◦ AA310006(0x54) → U+FE84


Code page 1098: https://public.dhe.ibm.com/software/globalization/gcoc/attachments/CP01098.pdf

https://github.com/unicode-org/icu-data/blob/main/charset/data/ucm/ibm-1098_P100-1995.ucm
  ◦ AA210006(0x88) → U+F8FA
  ◦ AA010006(0x8B) → U+F8FB
  ◦ AA310006(0x8F) → U+F8F9

https://github.com/unicode-org/icu-data/blob/main/charset/data/ucm/ibm-1098_X100-1995.ucm
  ◦ AA210006(0x88) → U+FE82
  ◦ AA010006(0x8B) → U+FE8E
  ◦ AA310006(0x8F) → U+FE84


Code page 1127: https://public.dhe.ibm.com/software/globalization/gcoc/attachments/CP01127.pdf

https://github.com/unicode-org/icu-data/blob/main/charset/data/ucm/ibm-1127_P100-2004.ucm
  ◦ AS010000(0x9C) → U+F8F6
  ◦ AS230000(0x9D) → U+F8F5
  ◦ AS450000(0x9E) → U+F8F4
  ◦ AD450000(0x9F) → U+F8F7
  ◦ AA210006(0xA1) → U+F8FA
  ◦ AA310006(0xA2) → U+F8F9
  ◦ AA310406(0xA3) → U+F8F8
  ◦ AA010006(0xA5) → U+F8FB



—Har.


Reply via email to