All,
I am investigating differing behavior in various environments of the 
wide-character version of the C function isAlpha with respect to 
character U+30FC KATAKANA-HIRAGANA PROLONGED SOUND MARK. Some 
implementations indicate that it is alphabetic, some don't. I 
suspect that other characters might be subject to the same confusion.

The UNICODE documents seem abiguous on this point: the General 
Catetory is "Lm" which, although informative instead of normative, 
would seem to imply that it is alphabetic; likewise 
DerivedCoreProperties-4.0.0.txt indicates that it is alphabetic; but 
PropList-4.0.0.txt contains two records - one indicating that it is 
a diacritic, one that indicates it is an extender.

On to my questions:

Q1: Can a character be both alphabetic and diacritic?

Q2: Is there a difinitive answer as to whether this is an alphabetic 
character?

Thanks in advance for answers to these questions and/or any 
additional isight you can provide.

Regards,
Rob Mount








Reply via email to