holes (unassigned code points) in the code charts

Stephan Stiller Fri, 04 Jan 2013 02:56:35 -0800

All,

There are plenty of unassigned code points within blocks that are inuse; these often come at the end of a block but there are plenty ofholes as well.


I have a cluster of interrelated questions:

1. What sorts of reasons are there (or have there been) for leavingholes? Code page conversion and changes to casing by simple arithmetic?What else?1.1 The rationale for particular holes is not documented in the codecharts I looked at; is there documentation? (Yes, in some instances theanswer can be guessed.)1.2 How is the number of holes determined? It seems like multiples of 16are used for block sizes merely for practical reasons.2. I notice that ranges are often used to describe where scripts arefound. Do holes have properties? Are the other block-related policiesthat gives holes a certain semantics?2.1 If not, how likely is it that Unicode assigns script-externalcharacters to holes?2.2 If yes, how does the number of assigned code points differ, if holesthat are assumed to be filled only by certain types of characters arecounted?2.2.1 Would this make much of a difference wrt the question (this comesup from time to time it seems) of how much of Unicode will eventuallyfill up?

3. Have there been "mistakes" wrt to hole assignment?

Stephan

holes (unassigned code points) in the code charts

Reply via email to