I am looking for advice from the Unicode community.

I am working within the Finnish NB on a proposal for additional characters
used to write the Komi-Permyak and Komi-Zyrian  languages in Latin script
in the 1930s (1932-1937 in Komi-Permyak (Latin alone) and 1932-1935 years
in Komi-Zyrian publications). Prior to this period a modified Cyrillic
alphabet (Molodcov, for which supplementary characters are encoded in the
range U+0500–050F) was used by both the Komi-Zyrians and Komi-Permyaks (in
1936 the Komi-Zyrians completely reverted to Molodcov, which some
Komi-Zyrian publishers had retained throughout). By late 1938 both
Komi-Permyak and Komi-Zyrian orthographies became closely aligned with the
Russian Cyrillic system with only two supplementary characters). The
previously used, Old Permic characters are encoded in the range

Komi belongs to the Uralic (Finno-Ugric) family of languages, related to
Finnish. It is spoken in the Republic of Komi, a member state of the
Russian Federation and the Permski Krai.

The additional Latin characters to be proposed include Latin capital and
small letters C, D, L, S, T and  ɜ with descenders. They also include a
number of Cyrillic letters, capital and small Ukrainian IE (in Komi a hard
affricate CHA) and Soft Sign (in Komi a high central unrounded vowel), used
together with Latin letters.  Could/should these (four) be encoded as Latin
characters (which would clearly add to confusables) or how could the mix of
scripts be best handled?

Sincerely, Jack Rueter

Ph.D., Language Researcher at University of Helsinki

Reply via email to