I searhed the Xerces FAQ, tried to review the mailing list archives but they
appeared to be offline, and was not able to find much information on my
question. So, if this has been asked/answered before, I apologize for
reposting.
I am using Xerces version 1.4.2 to parse an XML document containing a Euro
sign character. I create a FileInputStream
// create input stream from XML file
FileInputStream inputStream(new File(fileName));
// parse XML
parser.parse(new InputSource(inputStream));
When the XML sign containing the Euro sign is parsed, it is misread,
converting it to a different character (in this case a question mark). I
tried to change the document encoding to UTF-16 instead of UTF-8 but this
generated an exception stating that UTF-16 was not supported.
In order to write the XML file (containing the Euro sign), I have to make
sure the data is written out as characters instead of bytes because when the
Euro sign is converted to a byte, it looks like the high order byte is
discarded resulting in the wrong character being written out.
Finally, my question is whether I can use Xerces to parse an XML document
containing the Euro sign and if so, how do I do it?
I appreciate any help offered.
Thanks.
- Mike
---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]