On May 1, 2007, at 12:19 PM, Jim Jewett wrote:

> On 5/1/07, "Martin v. Löwis" <[EMAIL PROTECTED]> wrote:
>
>> The identifier syntax is <ID_Start> <ID_Continue>\*.
>
>> ID_Start is defined as all characters having one of the general
>> categories uppercase letters (Lu), lowercase letters (Ll), titlecase
>> letters (Lt), modifier letters (Lm), other letters (Lo), letter
>> numbers (Nl), plus the underscore (XXX what are "stability extensions
>> listed in UAX 31).
>
> Are you sure that modifier letters should be included?  The standard
> says so, but as nearly as I can tell, these are really more like
> diacritics -- and some of them look an awful lot like punctuation.
>
>     http://unicode.org/charts/PDF/U02B0.pdf

The entire point of these characters is that they are to be treated  
as letters (that is, can make up part of a word). If they were  
punctuation or diacritics, the other very-similar-looking characters  
in other parts of the codespace could be used. These letters seem to  
be mainly intended for spelling out phonetic pronunciations. It's  
unlikely that anyone would want to write an python identifier in IPA,  
but that's not a good reason to go against the standard.

James
_______________________________________________
Python-3000 mailing list
Python-3000@python.org
http://mail.python.org/mailman/listinfo/python-3000
Unsubscribe: 
http://mail.python.org/mailman/options/python-3000/archive%40mail-archive.com

Reply via email to