I don't think this is a bug -- if you look at the XML spec, > doesn't 
have to be escaped. http://www.w3.org/TR/1998/REC-xml-19980210#dt-chardata may 
make it clearer or, as with many of these things, it may make it muddier: some 
people interpret the spec to say > must be escaped and some that it may. 
Somewhere in the versions, Xerces changed it's mind (it used to escape >, now 
it doesn't). The resulting XML is still well-formed (and should parse -- are 
you saying it doesn't?)

        Chris

-----Original Message-----
From: Rajic Zeljko [mailto:[EMAIL PROTECTED]
Sent: Wednesday, July 09, 2003 12:04 AM
To: [EMAIL PROTECTED]
Subject: escaping does not work properly 


Hi,

I'm currently experiencing a strange behaviour when using Xerces to serialize a 
XML document: for some reason the ">" character does not get escaped to ">" 
when adding it as attribute or element value.

To demonstrate what I mean I've attached a small Java Application. When running 
it, I receive the following output:

<?xml version="1.0" encoding="UTF-8"?>
<testcase name="testcasename" time="19:00:12">
    <error type="This is the testcase description: a tag &lt;descriptiontag> !!"
>This is the error text with the &lt;errortag> !!</error>
</testcase>

Instead I'd expect to receive the following:

<?xml version="1.0" encoding="UTF-8"?>
<testcase name="testcasename" time="19:00:12">
    <error type="This is the testcase description: a tag &lt;descriptiontag%gt; 
!!"
>This is the error text with the &lt;errortag&gt; !!</error>
</testcase>


I've tried the application with a older version of Xerces, but also with the 
current 2.4.0. Both with the same result.

As I can't imagine to be the first to see this behaviour I guess that I must 
doing something wrong. So it would be nice if someone could point me to my 
mistake.

And by the way: is it possible to escape more characters than only "<>&' with 
Xerces or SAX in common? For example also escaping the [] characters?

Regards,
Zeljko

Reply via email to