Of course, a default grapheme cluster is exactly what Johann was trying to represent in 64 bits in his Excessive Memory Usage Encoding. It's unfortunate that 64 bits just isn't enough for this purpose.
It would be a whole lot easier if Unicode types would only use the same words for things as the rest of the world. I suggest:
(1) A codepoint is still called a codepoint. No problem there.
(2) The object currently called a "character" be renamed as something like "mapped codepoint" or "encoded codepoint", or possibly (coming in from the other end) something like "sub-character" or "character component" or "characterette" (which can be shortened to "charette" and pronounced "carrot". :-) )
(3) The object currently called a "default grapheme cluster" be renamed as "character".
(4) The object currently called a "tailored grapheme cluster" be renamed as "tailored character"
This would make even /our/ conversations a lot less confusing. Jill

