Peter Constable asked:

> U+0027 APOSTROPHE  has a general category of Po; U+02BC MODIFIER LETTER 
> APOSTROPHE has a general category of Lm. I haven't checked how they 
> compare with regard to any other properties. I'm wondering what kinds of 
> text processes might be expected to distinguish between these (i.e. give 
> different results / behaviours for the two characters).

Well, for starters: isLetter() and isIdentifier() should give different
results. U+02BC should be part of identifiers by default -- it is part
of the alphabet of some languages. On the other hand, U+0027 is very
often a syntax character, used as a 'quote' mark to indicate delimitation
of an identifier or other symbol.

--Ken


Reply via email to