RE: Making Xerces less strict?

david_n_bertoni Thu, 23 Sep 2004 09:48:54 -0700

> I think the blame belongs with your clients' authoring tools, which
> should help them produce well-formed documents.  On the other hand, if
> you want to work around the presence of certain illegal characters, you
> could (as a service) translate them into character entities before
> handing them off to a parser.  (I wouldn't, though.  You'll come to
> regret it, perhaps when you have to accept a UTF-16 document, and a byte
> of data is no longer even roughly equivalent to a character.)


Character entities (did you mean numerica character references?) will not 
help in XML 1.0 if the character is forbidden.  In XML 1.1, you can use 
the "control" characters x1-x1F through numeric character references.

However, I absolutely agree with you that the parser should not enable 
parsing of documents that are not well-formed.  Fix the documents, don't 
break the parser!

Dave

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

RE: Making Xerces less strict?

Reply via email to