> I think the blame belongs with your clients' authoring tools, which > should help them produce well-formed documents. On the other hand, if > you want to work around the presence of certain illegal characters, you > could (as a service) translate them into character entities before > handing them off to a parser. (I wouldn't, though. You'll come to > regret it, perhaps when you have to accept a UTF-16 document, and a byte > of data is no longer even roughly equivalent to a character.)
Character entities (did you mean numerica character references?) will not help in XML 1.0 if the character is forbidden. In XML 1.1, you can use the "control" characters x1-x1F through numeric character references. However, I absolutely agree with you that the parser should not enable parsing of documents that are not well-formed. Fix the documents, don't break the parser! Dave --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]