Unicode TR25 (UNICODE SUPPORT FOR MATHEMATICS) specifies a lot of this stuff, 
but is more for output of stuff like tex or mathml than programming langauges, 
it has things like invisible commas so that subscript operators can have 
multiple indexes next to eachother and the underlying encoding can distinguish 
that from a subscript containing a multi-character subscript identifier.

Reply via email to