Dan Bron wrote: > Also, I think I once saw a Unicode spec laying out all the characters the > consortium considered appropriate to compose programming language > identifiers. Am I misremembering? If not, can someone point me at it, > and if so, I'd like suggestions on useful word-formatting rules for > Unicode characters.
Dan: I think the Unicode data you want is http://www.unicode.org/Public/UNIDATA/DerivedCoreProperties.txt If you search for the derived properties ID_Start and ID_Continue you get lists of code points for appropriate characters. What kind of word-formatting do you want to do? Best wishes, John ---------------------------------------------------------------------- For information about J forums see http://www.jsoftware.com/forums.htm
