[ http://issues.apache.org/jira/browse/XERCESJ-1094?page=all ]
Michael Glavassevich resolved XERCESJ-1094:
-------------------------------------------
Resolution: Fixed
Thanks for reporting. This actually had nothing to do with the encoding of the
document, but rather how Xerces deals with unescaped control characters in
attribute values. Instead of reporting an error, it kept looping back into the
scanner. Should be fixed now in CVS.
> Xerces in infinite loop validating wrongly encoded XML 1.1 documents
> --------------------------------------------------------------------
>
> Key: XERCESJ-1094
> URL: http://issues.apache.org/jira/browse/XERCESJ-1094
> Project: Xerces2-J
> Type: Bug
> Versions: 2.7.1
> Environment: Linux, Solaris, jdk 1.2.2/1.4/1.5
> Reporter: Yves Lafon
> Assignee: Michael Glavassevich
> Priority: Blocker
>
> When parsing a XML1.1 document from an InputSource, where the encoding is set
> to iso-8859-1, with an encoding set to UTF-8 in the XML declaration, and with
> a iso-8859-2 character in an attribute, then xerces enters an infinite loop.
> If the same character is not in the attribute, then Xerces reports an invalid
> XML character instead of blocking.
> If the encoding of the input source is not set to iso-8859-1, Xerces works
> fine also.
> Sample doc and modified DocumentScanner that demonstrate the issue at
> http://jigsaw.w3.org/Yves/xercesBug.zip
> Thanks,
--
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see:
http://www.atlassian.com/software/jira
---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]