* William A. Rowe, Jr. wrote: > Brandon Fosdick wrote: > > Joe Orton wrote: > >>The 𐀀 character will be passed through in its four byte UTF-8 > >>form (which is 0xf4 0x80 0x80 0x80 I think) > > FYI - 65536 isn't a valid ucs-2 character; it is, however, a valid ucs-4 > character. > > That might be part of the origin of your issues, try 65535 as a MAX_VAL > for ucs-2 (which would be a three-byte utf-8 value.) > > 65536 cannot be mapped to utf-8, but it can be mapped as a four byte > utf-16 sequence.
Sure, it can. The utf-8 sequence is "\xf0\x90\x80\x80". nd -- Flhacs wird im Usenet grundsätzlich alsfhc geschrieben. Schreibt man lafhsc nicht slfach, so ist das schlichtweg hclafs. Hingegen darf man rihctig ruhig rhitcgi schreiben, weil eine shcalfe Schreibweise bei irhictg nicht als shflac angesehen wird. -- Hajo Pflüger in dnq
