J Decker wrote: > I generally accepted any utf-8 encoding up to 31 bits though ( since > I was going from the original spec, and not what was effective limit > based on unicode codepoint space)
Hey, everybody: Don't do that. UTF-8 has been constrained to the Unicode code space (maximum U+10FFFF, four bytes) for almost fourteen years now. -- Doug Ewell | Thornton, CO, US | ewellic.org