With my document shepherd hat on, I just reviewed draft-ietf-precis-mappings. I have sent some editorial comments to the authors. I also found two more substantive issues...

1. The "local case mapping" method specified in Section 2.3 talks about locale and context. However, the example in the second paragraph is a matter of language, not locale:

   As an example of locale and context-dependent mapping, LATIN CAPITAL
   LETTER I ("I", U+0049) is normally mapped to LATIN SMALL LETTER I
   ("i", U+0069); however, if the case of Turkish (or one of several
   other languages), unless an I is before a dot_above, the character
   should be mapped to LATIN SMALL LETTER DOTLESS I (U+0131).

As I understand it, locale (see Section 8 of RFC 6365) would refer to a particular region within a language-speaking community, such as Switzerland within the German-speaking areas.

The SpecialCasing.txt file in the Unicode standard talks about language-sensitive mappings for the Lithuanian, Turkish, and Azeri languages. (It also talks about a language-insensitive mapping, i.e., context-dependent mapping, for Greek final sigma.) It does not talk about locale-dependent mappings for particular regions within any language-speaking communities.

Therefore, I wonder if all mentions of locale in draft-ietf-precis-mappings really ought to be mentions of language. On reading the text in the document right now, I provisionally concluded that this switch would make sense, but I haven't thought carefully about every instance. And I would be curious to hear from the authors and working group about this issue.

2. Appendix B purports to describe why local case mapping needs to be an alternative to Unicode Default Case Mapping instead of being applied sequentially (the text mentions the possibility of applying local case mapping before Unicode Default Case Mapping - is that the only option, or should we say something about applying it after?).

However, Appendix B only mentions eszett (U+00DF) and to my mind does not provide a complete argument for why local case mapping needs to be an alternative to Unicode Default Case Mapping. At the least, it might be valuable to mention the handling of characters other than eszett. I suppose the basic argument is already in Section 2.3, but if so then I think that Appendix B might have a misleading title.

Other than these two issues, I think the document is in good shape (modulo some editorial adjustments).

Peter

--
Peter Saint-Andre
https://andyet.com/

_______________________________________________
precis mailing list
[email protected]
https://www.ietf.org/mailman/listinfo/precis

Reply via email to