* [EMAIL PROTECTED] | | Why not store codepoint numbers as bignums?
You can. If you use the UTF-32/UCS-4 encoding of Unicode you use four bytes per character, which is effectively what you're suggesting. The UTF-16, UTF-8, and UTF-7 encodings behave differently, but the option of using a fixed-length encoding is there. --Lars M.

