[ http://issues.apache.org/jira/browse/XERCESJ-1156?page=all ]
Jongjin Choi updated XERCESJ-1156:
----------------------------------
Attachment: XERCESJ-1156.jar
test case xml files.
- sample-bad.xml
- sample-good.xml
- sample.dtd
- 샘플.dtd (korean name file)
the UCS codes for this file : U+C0D8, U+D50C
> MalformedURLException occurs when SYSTEMID value is korean character relative
> url
> ---------------------------------------------------------------------------------
>
> Key: XERCESJ-1156
> URL: http://issues.apache.org/jira/browse/XERCESJ-1156
> Project: Xerces2-J
> Type: Bug
> Versions: 2.6.2, 2.8.0
> Environment: Windows XP (korean edition), JDK 1.4, JDK 5.0, xerces 2.6.0,
> xerces 2.8.2
> Reporter: Jongjin Choi
> Attachments: XERCESJ-1156.jar
>
> If a xml file has a DOCTYPE and the SYSTEMID of DOCTYPE has a korean
> character relative url,
> MalformedException occurs during parsing.
> I guess that the same problem will occur if japanese or chinese character is
> used in SYSTEMID.
> This does not occur in JDK 1.4.2 with its internal crimson parser.
> I found that Xerces 2.6.2, Xerces 2.8.0 and JDK 1.5 have this problem.
> This is related to XERCESJ-391 which seems to be fixed over Xerces 2.6.2 in
> my test.
> The test case xml/dtd and patch for Xerces 2.6.2 will be uploaded.
> The patch is simple. The SYSTEMID value which has non-us-ascii character
> needs to be escaped.
> The escaping logic is from the XMLEntityManager#getUserDir().
> I think that xerces 2.8.0 can be patched in the same way.
--
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see:
http://www.atlassian.com/software/jira
---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]