On 19 Jun 2013, at 13:41, Denis Jacquerye <[email protected]> wrote:

>> The same way one would rationalize using precomposed ãẽĩñõũỹ (aeinouy with 
>> tilde) but a necessarily de-composed g̃ (g with tilde) in Guaraní.
> 
> This is wrong: ãẽĩñõũỹ normalize to use U+0303 in NFD, so they 
> canonically use the same tilde as g̃.

Only in text which has been decomposed. Not all text gets decomposed. 

> The 4 additional non decomposable characters with Marshallese with cedilla 
> would not normalize to use the same cedilla as the others Marshallese 
> characters with cedilla. The would no canonically use the
> same cedilla.

That's correct. Why? Because the Latvians are using L+cedilla and N+cedilla, 
and we cannot change that. Mistakes were made in 1990 and 1991. We have to live 
with those mistakes. (Some mistakes we don't have t live with -- but this one 
we do.)

>>> It would require less new characters to be encoded and would make it easier 
>>> to support in fonts (adding 1 instead of 4).
>> 
>> No! Because if you added a single new character you'd have to make sure you 
>> had good glyph placement with LlMmNnOo which is eight glyphs.
> 
> The best practice would require to add diacritical mark placement whenever 
> necessary if not on all possible base character, M/m and O/o would still need 
> either way, L/l and N/n would need it for other
> combining diacritics either way.

In my fonts I do this with pre-composed glyphs. I don't really know how easy or 
how reliable attachment points for floating diacritics is. 

> A modern font already needs to be able to correctly place combining 
> diacritics, including cedilla or ogonek.
> Navajo and other languages need other placement of ogonek than that of 
> European languages.

I'd like to see evidence for that assertion. When I was in Lithuania last week 
I saw many examples of badly-placed ogonek, particularly on capital letters. 
What is your assertion about Navajo?

> This does not mean it is justified to encode single precomposed Navajo ogonek 
> characters.

But that isn't the issue here. 

> The placement of the cedilla is not semantically different, m̧ with the 
> cedilla on the left has the same meaning as if the cedilla were centered or 
> on the right, even if just one of the two is correct in
> some contexts like in Marshallese.

That isn't the issue here. The issue is that Marshallese uses a cedilla shape, 
but that despite the decomposition, Latvian letters with cedilla are drawn with 
commas below. So Marshallese gets the wrong glyph for L+cedilla and N+cedilla. 

> This does not mean it is justified to encode m with left cedilla, m with 
> centered cedilla or m with right cedilla.
> An additional single combining diacritics would behave the same way.

This isn't about diacritic positioning. It's about glyph shape. 

>>> ALA-LC romanizations use cedilla with r as they do under c or s.
>> 
>> Does ŗ contrast with r̦ in ALA-LC romanization?
> 
> The same way Marshallese has cedilla letters contrasting with comma below 
> letters.
> The only correct form is with cedilla and it doesn't use comma below.

All right: where does ŗ contrast with r̦ in ALA-LC romanization?

>> Do you think that encoding one new COMBINING MARSHALLESE CEDILLA will not 
>> cause problems both with existing COMBINING COMMA BELOW and COMBINING 
>> CEDILLA?
> 
> About the confusability, it is too late. Comma below, cedilla, palatalized 
> hook below, ring half ring below and probably others are already confusable. 
> Adding another will increase confusability but not
> to a relevant degree.

But there's nothing wrong with the current representation of Marshallese M̧ m̧ 
or O̧ o̧. Those are fine. 

> Having 4 single characters will not make anything less confusable (using 
> U+0327 with M/m and O/o but not with L/l and N/n is confusing) 

That's a different kind of confusion. It might be counter-intuitive to use 
COMBINING CEDILLA with M/m and O/o and pre-composed characters with L/l and 
N/n, but 

> although it is a solution is does not solve the general problem of cedilla.

We can't really "solve" the general problem. 

> If we don't want additional confusing characters maybe we should have CGJ, 
> ZWJ or ZWNJ + combining cedilla (or any other similar sequence) to optionally 
> differentiate the types of cedillas in Latvian,
> Livonian, Marshallese and romanizations.

We can't really use these with combining diacritical marks. 

> The issue of cedilla can easily be solved at a higher level, font 
> technologies like OpenType can easily display glyphs in Latvian or Livonia 
> and different glyphs for Marshallese.

Only in environments which permit language tagging. I'd like Marshallese 
children to be able to write their language in filenames. 

Michael Everson * http://www.evertype.com/



Reply via email to