Andrew C. West <andrewcwest at alumni dot princeton dot edu> wrote: >> An invalid GB18030 sequence, like <FE 40>, or a valid but out-of- >> range sequence, like <E3 32 9A 36>, should be treated just like an >> invalid or out-of-range UTF-8 sequence. Issue an error message, >> format the hard disk, whatever; just don't try to treat it like a >> normal character. > > Hmm, surely <FE 40> is a valid GB-18030 sequence = U+FA0C according to > my reckoning (although Word fails to correctly convert <FE 40> when > told to open a file as GB-18030, it does save U+FA0C as <FE 40> when > told to save as GB-18030).
Oops, sorry. I goofed in my example. Substitute <FE 20> or something similar. -Doug Ewell Fullerton, California http://users.adelphia.net/~dewell/

