Brandon Fosdick wrote:
Joe Orton wrote:

The 𐀀 character will be passed through in its four byte UTF-8 form (which is 0xf4 0x80 0x80 0x80 I think)

FYI - 65536 isn't a valid ucs-2 character; it is, however, a valid ucs-4
character.

That might be part of the origin of your issues, try 65535 as a MAX_VAL
for ucs-2 (which would be a three-byte utf-8 value.)

65536 cannot be mapped to utf-8, but it can be mapped as a four byte
utf-16 sequence.

Bill

Reply via email to