Hi everyone,

I believe the motivation for always escaping '>', had to with to do with
']]>' which is the end delimiter for a CDATA section. That string cannot
appear in CharData, so the '>' needs to be escaped during serialization of
']]>'. Since the serializer doesn't keep track of previous characters,
currently the only way to solve the problem is to always escape '>', and
indeed that was the change that was made.

Hope that helps.

On Wed, 9 Jul 2003, Christopher Ebert wrote:

>
>       I don't think this is a bug -- if you look at the XML spec, > doesn't 
> have to be escaped. http://www.w3.org/TR/1998/REC-xml-19980210#dt-chardata 
> may make it clearer or, as with many of these things, it may make it muddier: 
> some people interpret the spec to say > must be escaped and some that it may. 
> Somewhere in the versions, Xerces changed it's mind (it used to escape >, now 
> it doesn't). The resulting XML is still well-formed (and should parse -- are 
> you saying it doesn't?)
>
>       Chris
>
> -----Original Message-----
> From: Rajic Zeljko [mailto:[EMAIL PROTECTED]
> Sent: Wednesday, July 09, 2003 12:04 AM
> To: [EMAIL PROTECTED]
> Subject: escaping does not work properly
>
>
> Hi,
>
> I'm currently experiencing a strange behaviour when using Xerces to serialize 
> a XML document: for some reason the ">" character does not get escaped to 
> ">" when adding it as attribute or element value.
>
> To demonstrate what I mean I've attached a small Java Application. When 
> running it, I receive the following output:
>
> <?xml version="1.0" encoding="UTF-8"?>
> <testcase name="testcasename" time="19:00:12">
>     <error type="This is the testcase description: a tag &lt;descriptiontag> 
> !!"
> >This is the error text with the &lt;errortag> !!</error>
> </testcase>
>
> Instead I'd expect to receive the following:
>
> <?xml version="1.0" encoding="UTF-8"?>
> <testcase name="testcasename" time="19:00:12">
>     <error type="This is the testcase description: a tag 
> &lt;descriptiontag%gt; !!"
> >This is the error text with the &lt;errortag&gt; !!</error>
> </testcase>
>
>
> I've tried the application with a older version of Xerces, but also with the 
> current 2.4.0. Both with the same result.
>
> As I can't imagine to be the first to see this behaviour I guess that I must 
> doing something wrong. So it would be nice if someone could point me to my 
> mistake.
>
> And by the way: is it possible to escape more characters than only "<>&' with 
> Xerces or SAX in common? For example also escaping the [] characters?
>
> Regards,
> Zeljko
>

--------------------
Michael Glavassevich
[EMAIL PROTECTED]

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to