Thanks for that.
I'm getting an XMLException when parsing the UTF-8 chars, e.getMessage() is
40094298.
what should I set the charSizes pointer to in this function?:
XMLUTF8Transcoder::transcodeFrom(const XMLByte* const
srcData
, const unsigned int srcCount
, XMLCh* const toFill
, const unsigned int maxChars
, unsigned int& bytesEaten
, unsigned char* const charSizes)
At the moment, I'm doing a unsigned char* charSizes = new unsigned
char[len+1], but it does not work.
Regards,
Mihai Matei
----- Original Message ----
From: Alberto Massari <[EMAIL PROTECTED]>
To: [email protected]
Sent: Monday, May 21, 2007 8:32:28 AM
Subject: Re: UTF-8 characters in CDATA section
At 17.59 19/05/2007 -0700, Mihai Matei wrote:
>Can you point me to some sample code doing this? Do I have to
>recompile the library with this new transcoder?
No, the transcoder is already part of the library. Try something like this:
XMLUTF8Transcoder tx(0, 512);
size_t len = XMLString::stringLen(toTranscode);
const XMLCh* unicode = new XMLCh[len+1];
unsigned int charsEaten= 0;
tx.transcodeFrom(toTranscode, len+1, unicode, len+1, charsEaten,
XMLTranscoder::UnRep_Throw);
...
delete [] unicode;
Alberto
>----- Original Message ----
>From: Alberto Massari <[EMAIL PROTECTED]>
>To: [email protected]
>Sent: Friday, May 18, 2007 10:46:40 AM
>Subject: Re: UTF-8 characters in CDATA section
>
>The X() macro is a helper class that converts from the local encoding
>to Unicode; if you have UTF-8 data, you need to use instead the UTF-8
>transcoder.
>
>Alberto
>
>At 02.40 18/05/2007 -0700, Mihai Matei wrote:
> >Hi,
> >
> >I'm trying to add the attached file's contents to a CDATA section in
> >an xml. It contains a few Unicode-UTF8 characters from
> ><http://www.columbia.edu/kermit/utf8-t1.html>http://www.columbia.ed
> u/kermit/utf8-t1.html.
> >(you can view the file with Firefox, set the Character Encoding to
> >Unicode(UTF8)).
> >
> >//string 'text' has the contents;
> >//if I output it to a file with ofstream, the UTF8 characters are preserved
> >
> >DOMElement* pText = pDoc->createElement( X(tag.c_str()));
> >DOMCDATASection* pCdata = pDoc->createCDATASection(X(text.c_str()));
> >pText->appendChild(pCdata);
> >parent->appendChild(pText);
> >
> >the resulting xml however loses the UTF-8 characters. Is it the X()
> >macro that is to blame, or can I set other XML Document properties
> >so I keep my UTF8 chars?
> >
> >Thanks.
> >
> >
> >Got a little couch potato?
> >Check out fun
> ><http://us.rd.yahoo.com/evt=48248/*http://search.yahoo.com/search?f
> r=oni_on_mail&p=summer+activities+for+kids&cs=bz>summer
> >activities for kids.
> >
> >
> >
> >Ready for the edge of your seat?
> ><http://us.rd.yahoo.com/evt=48220/*http://tv.yahoo.com/>Check out
> >tonight's top picks on Yahoo! TV.
>
>
>
>
>
>
>
>
>
>____________________________________________________________________________________Be
>
>a better Heartthrob. Get better relationship answers from someone
>who knows. Yahoo! Answers - Check it out.
>http://answers.yahoo.com/dir/?link=list&sid=396545433
____________________________________________________________________________________Be
a better Heartthrob. Get better relationship answers from someone who knows.
Yahoo! Answers - Check it out.
http://answers.yahoo.com/dir/?link=list&sid=396545433