|
In a message dated 11/5/2003 3:55:46 AM Pacific Standard Time, [EMAIL PROTECTED] writes:
Agreed. But to be fair to MySQL, they do mention as a potential problem I don't think this is an unique issue for MySQL about how to store the Unicode data, right? Basically, they have the followin choice:
UCS2 - as they are today as you describe
UTF-16 - that is what I think they should do but that might create issue for the "index" or substring operation
UTF-8
UCS4 or UTF-32 - that is what they think they may need if they support surrogate.
Mozilla use UTF-16 internally. glib use UCS4 as I understand for w_char in their "vendor definitation". MS use UTF-16 for Win32 api and OLE api (not sure about the internal since they are not open source). Tcl use UCS2 (and their converter does not handle surrogate)
This is a generic issue. Why it so special with MySQL? because the SQL api? ==================================
Frank Yung-Fong Tang System Architect, I�t�rn�ti�n�l D�v�l�pme�t, AOL Int�r��t�v� S�rvi�es AIM:yungfongta mailto:[EMAIL PROTECTED] Tel:650-937-2913 Yahoo! Msg: frankyungfongtan John 3:16 "For God so loved the world that he gave his one and only Son, that whoever believes in him shall not perish but have eternal life. Does your software display Thai language text correctly for Thailand users? -> Basic Conceptof Thai Language linked from Frank Tang's I�t�rn�ti�n�liz�ti�n Secrets Want to translate your English text to something Thailand users can understand ? -> Try English-to-Thai machine translation at http://c3po.links.nectec.or.th/parsit/ |
- Re: UTF-16 inside UTF-8 Jungshik Shin
- Re: UTF-16 inside UTF-8 Peter Kirk
- Re: UTF-16 inside UTF-8 Philippe Verdy
- Ill-formed sequences (was: Re: UTF-16 inside UT... Doug Ewell
- RE: Ill-formed sequences (was: Re: UTF-16 i... Addison Phillips [wM]
- Re: Ill-formed sequences (was: Re: UTF-... Doug Ewell
- Re: UTF-16 inside UTF-8 YTang0648
- Re: UTF-16 inside UTF-8 YTang0648
- Re: UTF-16 inside UTF-8 YTang0648
- Re: UTF-16 inside UTF-8 Doug Ewell
- Re: UTF-16 inside UTF-8 YTang0648
- Re: UTF-16 inside UTF-8 Peter Kirk
- Re: UTF-16 inside UTF-8 YTang0648
- Re: UTF-16 inside UTF-8 YTang0648
- Re: UTF-16 inside UTF-8 Doug Ewell
- Re: UTF-16 inside UTF-8 Philippe Verdy
- Re: UTF-16 inside UTF-8 Philippe Verdy
- Re: UTF-16 inside UTF-8 YTang0648
- Re: UTF-16 inside UTF-8 Philippe Verdy
- Re: UTF-16 inside UTF-8 Doug Ewell
- Re: UTF-16 inside UTF-8 YTang0648

