Re: No Error Reported When Serializing Invalid Character References in XML 1.0

2012-01-03 Thread Michael Glavassevich
Alberto Massari wrote on 01/03/2012 12:42:39 PM: > Hi Michael, > in this case the error is not caused by a character that the target > encoding doesn't support (Xerces-C would handle that). It's because a > node contains a character that XML is not supposed to accept. That's something that would

Re: No Error Reported When Serializing Invalid Character References in XML 1.0

2012-01-03 Thread Alberto Massari
Hi Michael, in this case the error is not caused by a character that the target encoding doesn't support (Xerces-C would handle that). It's because a node contains a character that XML is not supposed to accept. Alberto Il 03/01/2012 17:47, Michael Glavassevich ha scritto: Does Xerces-C's im

Re: No Error Reported When Serializing Invalid Character References in XML 1.0

2012-01-03 Thread Michael Glavassevich
Does Xerces-C's implementation of LSSerializer [1] support the "well-formed" parameter? It's a required feature. Turning that on in Xerces-J would cause an error to be reported for the invalid character. Thanks. [1] http://www.w3.org/TR/DOM-Level-3-LS/load-save.html#LS-LSSerializer Michael Glav

Re: No Error Reported When Serializing Invalid Character References in XML 1.0

2012-01-03 Thread Alberto Massari
Hi Nedim, it's a known limitation of the current codebase: see https://issues.apache.org/jira/browse/XERCESC-1854 You can check if a character is valid according to XML 1.0 by using XMLChar1_0::isXMLChar. For XML 1.1, use XMLChar1_1::isXMLChar Alberto Il 03/01/2012 15:01, Nedim Srndic ha scri

No Error Reported When Serializing Invalid Character References in XML 1.0

2012-01-03 Thread Nedim Srndic
Hello, is an invalid character reference in XML 1.0. If I write the byte value "\x01" to a Xerces-C TextNode and serialize the entire DOMDocument using UTF-8 and StdOutFormatTarget with XML version set to "1.0", then Xerces-C writes the resulting XML document (without substituting the character