"Reuben Sivan" <[EMAIL PROTECTED]> wrote on 10/29/2004 11:05:52 AM:

>
> I recently got a parsing error reading an XML file which I had produced
> programmatically.
> The error is org.xml.sax.SAXParseException: Illegal XML character &#xb;.
>
> The reason for the error, I found out, was that 0xb is an illegal XML
> character.
>
> I easily worked my way around the error, by filtering my data before
> writing it to XML (which is consistent with my application).
>
> In any case, my question is, given that 0xb and other control characters
> are illegal in XML,


They're not legal characters in XML 1.0 but they are in XML 1.1.

> shouldn't the function
> Document.createTextNode(String) check its input and generate an
> exception?

This was a conscious decision by the DOM WG [1], probably made for performance reasons. DOM Level 3 provides a method for checking well-formedness. See Document.normalizeDocument() [2].

> Thanks,
>
>    Reuben
>
>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: [EMAIL PROTECTED]
> For additional commands, e-mail: [EMAIL PROTECTED]
>


[1] http://www.w3.org/DOM/faq.html#WFXML
[2] http://www.w3.org/TR/2004/REC-DOM-Level-3-Core-20040407/core.html#Document3-normalizeDocument

Michael Glavassevich
XML Parser Development
IBM Toronto Lab
E-mail: [EMAIL PROTECTED]

E-mail: [EMAIL PROTECTED]

Reply via email to