I don't think this is a bug -- if you look at the XML spec, > doesn't
have to be escaped. http://www.w3.org/TR/1998/REC-xml-19980210#dt-chardata may
make it clearer or, as with many of these things, it may make it muddier: some
people interpret the spec to say > must be escaped and some that it may.
Somewhere in the versions, Xerces changed it's mind (it used to escape >, now
it doesn't). The resulting XML is still well-formed (and should parse -- are
you saying it doesn't?)
Chris
-----Original Message-----
From: Rajic Zeljko [mailto:[EMAIL PROTECTED]
Sent: Wednesday, July 09, 2003 12:04 AM
To: [EMAIL PROTECTED]
Subject: escaping does not work properly
Hi,
I'm currently experiencing a strange behaviour when using Xerces to serialize a
XML document: for some reason the ">" character does not get escaped to ">"
when adding it as attribute or element value.
To demonstrate what I mean I've attached a small Java Application. When running
it, I receive the following output:
<?xml version="1.0" encoding="UTF-8"?>
<testcase name="testcasename" time="19:00:12">
<error type="This is the testcase description: a tag <descriptiontag> !!"
>This is the error text with the <errortag> !!</error>
</testcase>
Instead I'd expect to receive the following:
<?xml version="1.0" encoding="UTF-8"?>
<testcase name="testcasename" time="19:00:12">
<error type="This is the testcase description: a tag <descriptiontag%gt;
!!"
>This is the error text with the <errortag> !!</error>
</testcase>
I've tried the application with a older version of Xerces, but also with the
current 2.4.0. Both with the same result.
As I can't imagine to be the first to see this behaviour I guess that I must
doing something wrong. So it would be nice if someone could point me to my
mistake.
And by the way: is it possible to escape more characters than only "<>&' with
Xerces or SAX in common? For example also escaping the [] characters?
Regards,
Zeljko