Hi everyone, I believe the motivation for always escaping '>', had to with to do with ']]>' which is the end delimiter for a CDATA section. That string cannot appear in CharData, so the '>' needs to be escaped during serialization of ']]>'. Since the serializer doesn't keep track of previous characters, currently the only way to solve the problem is to always escape '>', and indeed that was the change that was made.
Hope that helps. On Wed, 9 Jul 2003, Christopher Ebert wrote: > > I don't think this is a bug -- if you look at the XML spec, > doesn't > have to be escaped. http://www.w3.org/TR/1998/REC-xml-19980210#dt-chardata > may make it clearer or, as with many of these things, it may make it muddier: > some people interpret the spec to say > must be escaped and some that it may. > Somewhere in the versions, Xerces changed it's mind (it used to escape >, now > it doesn't). The resulting XML is still well-formed (and should parse -- are > you saying it doesn't?) > > Chris > > -----Original Message----- > From: Rajic Zeljko [mailto:[EMAIL PROTECTED] > Sent: Wednesday, July 09, 2003 12:04 AM > To: [EMAIL PROTECTED] > Subject: escaping does not work properly > > > Hi, > > I'm currently experiencing a strange behaviour when using Xerces to serialize > a XML document: for some reason the ">" character does not get escaped to > ">" when adding it as attribute or element value. > > To demonstrate what I mean I've attached a small Java Application. When > running it, I receive the following output: > > <?xml version="1.0" encoding="UTF-8"?> > <testcase name="testcasename" time="19:00:12"> > <error type="This is the testcase description: a tag <descriptiontag> > !!" > >This is the error text with the <errortag> !!</error> > </testcase> > > Instead I'd expect to receive the following: > > <?xml version="1.0" encoding="UTF-8"?> > <testcase name="testcasename" time="19:00:12"> > <error type="This is the testcase description: a tag > <descriptiontag%gt; !!" > >This is the error text with the <errortag> !!</error> > </testcase> > > > I've tried the application with a older version of Xerces, but also with the > current 2.4.0. Both with the same result. > > As I can't imagine to be the first to see this behaviour I guess that I must > doing something wrong. So it would be nice if someone could point me to my > mistake. > > And by the way: is it possible to escape more characters than only "<>&' with > Xerces or SAX in common? For example also escaping the [] characters? > > Regards, > Zeljko > -------------------- Michael Glavassevich [EMAIL PROTECTED] --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
