On Mar 18, 2004, at 8:33 AM, Arcane Jill wrote:


This probably is going to sound like a really dumb question, but ... I'm curious. Why are characters being assigned codepoints > U+FFFF, when there is still loads and loads of unused empty space below that point. Is the BMP being saved for something? Are codepoints < U+010000 reserved for something of which I am unaware? If so, what? If not, why are assignments being made up there in the astral planes?


Check the roadmaps <http://www.unicode.org/roadmaps/>.


By my calculations, the total number of currently existent Unicode characters is < 0x10000, which means that

your calculations are way off. Unicode 4.0 has over 96,000 characters. The Han repertoire alone is larger than 65536. (See <http://www.unicode.org/versions/Unicode4.0.0/>.)


========
John H. Jenkins
[EMAIL PROTECTED]
[EMAIL PROTECTED]
http://homepage..mac.com/jhjenkins/




Reply via email to