S02 Names - Alphabetic?

Helmut Wollmersdorfer Mon, 04 May 2009 11:09:41 -0700

http://perlcabal.org/syn/S02.html


----quote-----
Names

[...]

* An identifier is composed of an alphabetic character followed byany sequence of alphanumeric characters. The definitions of alphabeticand numeric include appropriate Unicode characters. Underscore is alwaysconsidered alphabetic. An identifier may also contain isolatedapostrophes or hyphens provided the next character is alphabetic.

----end of quote----

At a first view 'alphabetic ... include appropriate Unicode characters'sounds like 'characters with Unicode property Alphabetic'.

There are some 'Alphabetic' characters not working in rakudo, and somenot working in STD.pm (via evalbot), e.g.


ा U+093E DEVANAGARI VOWEL SIGN AA
   (Alphabetic, Spacing_Mark, ID_Continue=Yes)
   rakudo: FAIL, std: PASS
ⓐ U+24D0 CIRCLED LATIN SMALL LETTER A
   (Alphabetic, Other_Symbol, ID_Continue=No)
   rakudo: FAIL, std: FAIL

Wouldn't it be easier to reference the Unicode properties

1) ID_Start plus U+005F LOW LINE (=Underscore)
2) ID_Continue

for identifiers? That's what Unicode 'ID_x' is for.
With the nice 'side effect' that combining diacritics are in ID_Continue.

Helmut Wollmersdorfer

S02 Names - Alphabetic?

Reply via email to