Dan Bron wrote:

> Also, I think I once saw a Unicode spec laying out all the characters the
> consortium considered appropriate to compose programming language
> identifiers.  Am I misremembering?  If not, can someone point me at it,
> and if so, I'd like suggestions on useful word-formatting rules for
> Unicode characters.

Dan:

I think the Unicode data you want is

http://www.unicode.org/Public/UNIDATA/DerivedCoreProperties.txt

If you search for the derived properties ID_Start and ID_Continue you get
lists of code points for appropriate characters.

What kind of word-formatting do you want to do?

Best wishes,

John

----------------------------------------------------------------------
For information about J forums see http://www.jsoftware.com/forums.htm

Reply via email to