Re: asking advice of the Unicode community on new character proposal

2019-05-03 Thread Richard Wordingham via Unicode
On Fri, 3 May 2019 11:01:33 +0300
Jack Rueter via Unicode  wrote:

> The additional Latin characters to be proposed include Latin capital
> and small letters C, D, L, S, T and  ɜ with descenders. They also
> include a number of Cyrillic letters, capital and small Ukrainian IE
> (in Komi a hard affricate CHA) and Soft Sign (in Komi a high central
> unrounded vowel), used together with Latin letters.  Could/should
> these (four) be encoded as Latin characters (which would clearly add
> to confusables) or how could the mix of scripts be best handled?

The latter pair may already be encoded as U+0184/U+0185 LATIN
CAPITAL/SMALL LETTER TONE 6, which was once intended to use the glyph of
the Cyrillic soft sign.

The Ukrainian IE used in Latin script is certainly eligible, by the
principle of separation of scripts. The only challenge I can see would
be a claim that it was already encoded as U+0190/U+025B LATIN
CAPITAL/SMALL LETTER OPEN E.

Richard.



asking advice of the Unicode community on new character proposal

2019-05-03 Thread Jack Rueter via Unicode
Hello!



I am looking for advice from the Unicode community.



I am working within the Finnish NB on a proposal for additional characters
used to write the Komi-Permyak and Komi-Zyrian  languages in Latin script
in the 1930s (1932-1937 in Komi-Permyak (Latin alone) and 1932-1935 years
in Komi-Zyrian publications). Prior to this period a modified Cyrillic
alphabet (Molodcov, for which supplementary characters are encoded in the
range U+0500–050F) was used by both the Komi-Zyrians and Komi-Permyaks (in
1936 the Komi-Zyrians completely reverted to Molodcov, which some
Komi-Zyrian publishers had retained throughout). By late 1938 both
Komi-Permyak and Komi-Zyrian orthographies became closely aligned with the
Russian Cyrillic system with only two supplementary characters). The
previously used, Old Permic characters are encoded in the range
U+10350–1037F.



Komi belongs to the Uralic (Finno-Ugric) family of languages, related to
Finnish. It is spoken in the Republic of Komi, a member state of the
Russian Federation and the Permski Krai.



The additional Latin characters to be proposed include Latin capital and
small letters C, D, L, S, T and  ɜ with descenders. They also include a
number of Cyrillic letters, capital and small Ukrainian IE (in Komi a hard
affricate CHA) and Soft Sign (in Komi a high central unrounded vowel), used
together with Latin letters.  Could/should these (four) be encoded as Latin
characters (which would clearly add to confusables) or how could the mix of
scripts be best handled?



Sincerely, Jack Rueter

Ph.D., Language Researcher at University of Helsinki