[EMAIL PROTECTED] wrote: > On 06/12/2001 01:13:48 PM Jianping Yang wrote: > > >If you convert < ED A0 80 ED B0 80 > into UTF-16, what does it mean then? > I > >think definitely it means U-00010000. > > I'd say not if that 6-byte sequence is interpreted in terms of *UTF-8*. So UTF-8 is not compatible with UTF-16 even in its repository, which is not guaranteed that you will have a round-trip conversion, which may be a *big* issue. > > UTF-8 has no 6-byte sequences. It must be something else, like the thing > informally designated in our discussions as UTF-8S. UTF-8S proposal will keep round-trip conversion between UTF-16 and UTF-8S. Please don't confuse UTF-8S with UTF-8 as they are different encoding forms based on the proposal. Regards, Jianping. > > > - Peter > > --------------------------------------------------------------------------- > Peter Constable > > Non-Roman Script Initiative, SIL International > 7500 W. Camp Wisdom Rd., Dallas, TX 75236, USA > Tel: +1 972 708 7485 > E-mail: <[EMAIL PROTECTED]>
begin:vcard n:Yang;Jianping tel;fax:650-506-7225 tel;work:650-506-4865 x-mozilla-html:FALSE org:Server Gobalization Technology;Server Technology version:2.1 email;internet:[EMAIL PROTECTED] title:Senior Development Manager adr;quoted-printable:;;500 Oracle Packway=0D=0AM/S 659407;Redwood Shores;CA;94065; fn:Jianping Yang end:vcard