> Arabic needs tagging of glyphs as being `initial', `medial', `final', > and `isolated', as specified in the Unicode book. Since this is > identical for all fonts the OpenType designers have decided to make > this information not being part of the font itself.
I just had to struggle with this a little. The ARABIC LETTER HEH (U+0647) is a letter with 4 glyph forms. In Kurdish (written in the Sorani, essential arabic, alphabet) one has two letters (let me call them Kurdish H and Kurdish E) and these 4 glyph forms become the two forms of Kurdish H and the two forms of Kurdish E. Now these four glyphs are tagged with `initial', `medial', `final', and `isolated', and that is correct if the glyphs are used to write arabic, but incorrect when precisely the same glyphs are used to write Kurdish. I wonder what the correct way is to write Kurdish in Unicode (without using language tagging). Are new Unicode code points needed? Do these exist already? Andries -- Linux-UTF8: i18n of Linux on all levels Archive: http://mail.nl.linux.org/linux-utf8/
