Hello,

I'm having problems with character coding in my messages. I'm not that good
at character standards etc though.. However - I'm using a DOMParser in
Xerces-C 1.4 with ISO-8859-1 encoding.

I've coded a component that parses a message, modifies it, and returns the
modified message in C++ STL string format.
I'm using the functionality in the DOMPrint example to get the document in
textformat.

My problem is: I'm getting XML messages in textformat from another parser,
in another COBOL:ish system, and the messages are encoded different in some
way. Swedish characters are for example shown as "å" (don't know if u
can see this: "�", but it's an a with a ring above it =)). Xerces doesn't
seem to accept the "å" value, it says that "aring" is an unknown
entity. Other special codes, as "&" and ">" are parsed OK.

I've tried to create entities in the DTD like: <!ENTITY aring "&#229;"> and
then it works better - getNodeValue on "&aring;" then gets me the right
character.. But using the DOMPrint code on a document as that gives me
"&amp;ring;"...

I read a similar post about German character coding in DOMString and that
setlocale() could be used, I've tried that but couldn't make it better..

I'm sorry about my cheezy english, and my bad knowledge in character
encodings :-/

Any help on this is REALLY appreciated !! Thanks!

//Stefan


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to