On Fri, Feb 04, 2005 at 06:04:00AM -0500, Daniel Veillard wrote:
> On Fri, Feb 04, 2005 at 10:47:24AM +0000, David Harris wrote:
> > xmlDocDumpFormatMemory works fine in xmlbuff is coerced to char but not
> > wchar_t. How can I use wide characters? are there wide character versions
> > of
> > these functions? I certainly couldnt find them or any clue about using utf8
> > input for this library.
>
> You need to read more about the subject
> http://www.w3.org/TR/REC-xml/#sec-guessing
> if you use XML, you'd better read the associated spec. It does *not* have to
> show up at the API level. There is also specific doc about encoding in
> libxml2
> http://xmlsoft.org/xmlmem.html
> it' 2 clicks away from the main page http://xmlsoft.org/
Wrong link, the encoding explanations are actually
http://xmlsoft.org/encoding.html
which is directly listed as "Encodings support" on the main page.
Also "wide character" does not mean much is that UTF-16, UC2, UCS4, direct
Unicode code points ? An encoding means something it actually explicitely
tell that it means a specific code conversion to Unicode code points
and back (when possible). wchar_t does not say what is the integer content
nor even it's size which is left platform/compiler dependant, i.e. unusable
as a base for real technical discussion without more informations you did
not provide.
Character sets, encodings, Unicode, are hard concepts, but absolutely
needed if you really want to do internationalization properly. XML defined the
framework clearly, and you need to understand it (especially Appendix F) to
not make mistakes in an XML processing chain.
Daniel
--
Daniel Veillard | Red Hat Desktop team http://redhat.com/
[EMAIL PROTECTED] | libxml GNOME XML XSLT toolkit http://xmlsoft.org/
http://veillard.com/ | Rpmfind RPM search engine http://rpmfind.net/
_______________________________________________
xml mailing list, project page http://xmlsoft.org/
[email protected]
http://mail.gnome.org/mailman/listinfo/xml