Peter Constable asked: > U+0027 APOSTROPHE has a general category of Po; U+02BC MODIFIER LETTER > APOSTROPHE has a general category of Lm. I haven't checked how they > compare with regard to any other properties. I'm wondering what kinds of > text processes might be expected to distinguish between these (i.e. give > different results / behaviours for the two characters).
Well, for starters: isLetter() and isIdentifier() should give different results. U+02BC should be part of identifiers by default -- it is part of the alphabet of some languages. On the other hand, U+0027 is very often a syntax character, used as a 'quote' mark to indicate delimitation of an identifier or other symbol. --Ken

