On Mar 18, 2004, at 8:33 AM, Arcane Jill wrote:
This probably is going to sound like a really dumb question, but ... I'm curious. Why are characters being assigned codepoints > U+FFFF, when there is still loads and loads of unused empty space below that point. Is the BMP being saved for something? Are codepoints < U+010000 reserved for something of which I am unaware? If so, what? If not, why are assignments being made up there in the astral planes?
Check the roadmaps <http://www.unicode.org/roadmaps/>.
By my calculations, the total number of currently existent Unicode characters is < 0x10000, which means that
your calculations are way off. Unicode 4.0 has over 96,000 characters. The Han repertoire alone is larger than 65536. (See <http://www.unicode.org/versions/Unicode4.0.0/>.)
======== John H. Jenkins [EMAIL PROTECTED] [EMAIL PROTECTED] http://homepage..mac.com/jhjenkins/

