On 16 May 2005, at 23:52, Daniel Rall wrote:
I took a look through the spec, but nothing stood out. John, are there
any particular portions of the spec that I should be looking at in
particular? The section on valid characters is really clear that the
majority of control characters can't occur, but I didn't see any
discussion as to why replacing them with character references isn't a
good enough escaping mechanism. Not trying to be obstructionist -- just
trying to understand.
Daniel, I quite understand....
Section 2.2 defines the character ranges which can occur in a parsed entity. My understanding of a parsed entity is the parsing process replaces all the character references. So the "escaping" of characters has no effect.
I fired up Oxygen and did an experiment. When a document contains � I get the following error when I check for well formedness:
F Character reference "�" is an invalid XML character.
I believe that Xerces is used to perform this check.
John Wilson The Wilson Partnership http://www.wilson.co.uk