On May 1, 2007, at 12:19 PM, Jim Jewett wrote: > On 5/1/07, "Martin v. Löwis" <[EMAIL PROTECTED]> wrote: > >> The identifier syntax is <ID_Start> <ID_Continue>\*. > >> ID_Start is defined as all characters having one of the general >> categories uppercase letters (Lu), lowercase letters (Ll), titlecase >> letters (Lt), modifier letters (Lm), other letters (Lo), letter >> numbers (Nl), plus the underscore (XXX what are "stability extensions >> listed in UAX 31). > > Are you sure that modifier letters should be included? The standard > says so, but as nearly as I can tell, these are really more like > diacritics -- and some of them look an awful lot like punctuation. > > http://unicode.org/charts/PDF/U02B0.pdf
The entire point of these characters is that they are to be treated as letters (that is, can make up part of a word). If they were punctuation or diacritics, the other very-similar-looking characters in other parts of the codespace could be used. These letters seem to be mainly intended for spelling out phonetic pronunciations. It's unlikely that anyone would want to write an python identifier in IPA, but that's not a good reason to go against the standard. James _______________________________________________ Python-3000 mailing list Python-3000@python.org http://mail.python.org/mailman/listinfo/python-3000 Unsubscribe: http://mail.python.org/mailman/options/python-3000/archive%40mail-archive.com