[precis] shepherd review of draft-ietf-precis-mappings

Peter Saint-Andre - &yet Thu, 11 Jun 2015 16:05:59 -0700

With my document shepherd hat on, I just revieweddraft-ietf-precis-mappings. I have sent some editorial comments to theauthors. I also found two more substantive issues...

1. The "local case mapping" method specified in Section 2.3 talks aboutlocale and context. However, the example in the second paragraph is amatter of language, not locale:


   As an example of locale and context-dependent mapping, LATIN CAPITAL
   LETTER I ("I", U+0049) is normally mapped to LATIN SMALL LETTER I
   ("i", U+0069); however, if the case of Turkish (or one of several
   other languages), unless an I is before a dot_above, the character
   should be mapped to LATIN SMALL LETTER DOTLESS I (U+0131).

As I understand it, locale (see Section 8 of RFC 6365) would refer to aparticular region within a language-speaking community, such asSwitzerland within the German-speaking areas.

The SpecialCasing.txt file in the Unicode standard talks aboutlanguage-sensitive mappings for the Lithuanian, Turkish, and Azerilanguages. (It also talks about a language-insensitive mapping, i.e.,context-dependent mapping, for Greek final sigma.) It does not talkabout locale-dependent mappings for particular regions within anylanguage-speaking communities.

Therefore, I wonder if all mentions of locale indraft-ietf-precis-mappings really ought to be mentions of language. Onreading the text in the document right now, I provisionally concludedthat this switch would make sense, but I haven't thought carefully aboutevery instance. And I would be curious to hear from the authors andworking group about this issue.

2. Appendix B purports to describe why local case mapping needs to be analternative to Unicode Default Case Mapping instead of being appliedsequentially (the text mentions the possibility of applying local casemapping before Unicode Default Case Mapping - is that the only option,or should we say something about applying it after?).

However, Appendix B only mentions eszett (U+00DF) and to my mind doesnot provide a complete argument for why local case mapping needs to bean alternative to Unicode Default Case Mapping. At the least, it mightbe valuable to mention the handling of characters other than eszett. Isuppose the basic argument is already in Section 2.3, but if so then Ithink that Appendix B might have a misleading title.

Other than these two issues, I think the document is in good shape(modulo some editorial adjustments).


Peter

--
Peter Saint-Andre
https://andyet.com/

_______________________________________________
precis mailing list
[email protected]
https://www.ietf.org/mailman/listinfo/precis

[precis] shepherd review of draft-ietf-precis-mappings

Reply via email to