|
I think verdy_p's message is not very clear below. I think I know what he mean but the message itself need some clearification.
In a message dated 11/5/2003 3:39:09 AM Pacific Standard Time, [EMAIL PROTECTED] writes:
From: "Abdij Bhat" <[EMAIL PROTECTED]> I think you should say
"The UTF-8 seqences will not use C0 control code area (0x00-0x1F) to represent characters. " instead of "UTF-8 sequences will not contain any C0 control bytes, " because it is legal to have C0 control code inside UTF-8, for example, TAB, CR, LF are all in c0 area and perfectly legal in UTF-8.
but it will in many I think the rigth way to say is is "UTF-8 may use bytes 0x80 to 0x9F as part of multiple byte UTF-8 byte serquence for a single Unicode characters. And those bytes is defined as C1 control area. Therefore, code code sequence with 0x80 and 0x9f should not be insert into UTF-8 STREAM, but could be insert into UTF-16 STREAM (by using two bytes 0x0080 - 0x009F) .
Not only "You should not create escape sequences containing bytes
>= 0x80 after the leading escape " but also "You should not create escape sequences containing bytes >= 0x80 as the leading escape " Note that C1 controls of Unicode and ISO-8859-* will be converted to a pair ==================================
Frank Yung-Fong Tang System Architect, I�t�rn�ti�n�l D�v�l�pme�t, AOL Int�r��t�v� S�rvi�es AIM:yungfongta mailto:[EMAIL PROTECTED] Tel:650-937-2913 Yahoo! Msg: frankyungfongtan John 3:16 "For God so loved the world that he gave his one and only Son, that whoever believes in him shall not perish but have eternal life. Does your software display Thai language text correctly for Thailand users? -> Basic Conceptof Thai Language linked from Frank Tang's I�t�rn�ti�n�liz�ti�n Secrets Want to translate your English text to something Thailand users can understand ? -> Try English-to-Thai machine translation at http://c3po.links.nectec.or.th/parsit/ |
- UTF8 and COntrol Characters Abdij Bhat
- Re: UTF8 and COntrol Characters Doug Ewell
- Re: UTF8 and COntrol Characters Philippe Verdy
- RE: UTF8 and COntrol Characters Abdij Bhat
- Re: UTF8 and COntrol Characters YTang0648
- Re: UTF8 and COntrol Characters YTang0648
- Re: UTF8 and COntrol Characters Doug Ewell

