characters with accents

dara Fri, 24 Feb 2006 08:15:20 -0800

Hi all,

I am having a little fun with 'accented characters' in an applicationi'm working with at the minute.

Through various tracing and debug methods, i can see that the charactersare correctly being propogated around the system until the point wherethey are added to a DOM in Xerces.

After subsequent writing of the DOM to a MemBufFormatTarget* andretrieval through the getRawBuffer() interface, the data is -corrupt-.(i get characters out, but they are not accented and not the chars frommy buffer)

To remove any potential issues in the application, i've altered theDOMPrint sample to write to the same type of target. It correctly parsesand writes the small XML document I've given it with accented charactersand in the codepage:: iso-8859-1.

I then added some code to add another node with some accentedcharacters, and these do not appear in the output. (which isinteresting, my app is giving me corrupt data and the sampl isn't givingme anything).

I'm running v240 at the moment under a linux variant, and i've set theshell locale to the same as the xml document being parsed. However theapplication still reports the locale as "C", so I'm going to try a fewmore things.

But I've a feelling I'm missing something basic. Do I need to dosomething special with the Transcoder class, or the DOM document whenadding an element and text node ?


I'm sure somebody has met this already. Any pointers ?

Thanks and Regards

Dara

characters with accents

Reply via email to