Hello Harald,

The answer to your question is no. This is fatal error. A document is not well-formed if it contains malformed byte sequences [1].

[1] http://www.w3.org/TR/2004/REC-xml-20040204/#charencoding

On Mon, 26 Jul 2004, Harald Wehr wrote:

I have to process utf-8 documents. Sometimes a document contains an illegal character that causes an UTFDataFormatException due to the invalid byte.

Is it possible to tell xerces just to ignore these bytes and to go on parsing the document?

There is no need to display these documents 100 % correctly. A missing character is acceptable for us in this project rather than chrashing the whole document with this exception.

Thanks for your help

Harald



---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

--------------------------- Michael Glavassevich XML Parser Development IBM Toronto Lab E-mail: [EMAIL PROTECTED] E-mail: [EMAIL PROTECTED]

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



Reply via email to