[ 
http://issues.apache.org/jira/browse/XERCESJ-1156?page=comments#action_12373582 
] 

Jongjin Choi commented on XERCESJ-1156:
---------------------------------------

To reproduce : Use sax.DocumentTracer sample.

(1)  sucess case
java sax.DocumentTracer test-good.xml

(2) fail case
java sax.DocumentTracer test-bad.xml

The result is :

setDocumentLocator([EMAIL PROTECTED])
startDocument()
 startDTD(name="샘플",publicId=null,systemId="샘플.DTD")
error: Parse error occurred - no protocol: 샘플.DTD
java.net.MalformedURLException: no protocol: 샘플.DTD
        at java.net.URL.<init>(URL.java:537)
        at java.net.URL.<init>(URL.java:434)
        at java.net.URL.<init>(URL.java:383)
        at org.apache.xerces.impl.XMLEntityManager.setupCurrentEntity(Unknown 
Source)
        at org.apache.xerces.impl.XMLEntityManager.startEntity(Unknown Source)
        at org.apache.xerces.impl.XMLEntityManager.startDTDEntity(Unknown 
Source)
        at org.apache.xerces.impl.XMLDTDScannerImpl.setInputSource(Unknown 
Source)
        at 
org.apache.xerces.impl.XMLDocumentScannerImpl$DTDDispatcher.dispatch(Unknown 
Source)
        at 
org.apache.xerces.impl.XMLDocumentFragmentScannerImpl.scanDocument(Unknown 
Source)
        at org.apache.xerces.parsers.XML11Configuration.parse(Unknown Source)
        at org.apache.xerces.parsers.XML11Configuration.parse(Unknown Source)
        at org.apache.xerces.parsers.XMLParser.parse(Unknown Source)
        at org.apache.xerces.parsers.AbstractSAXParser.parse(Unknown Source)
        at sax.DocumentTracer.main(Unknown Source)

This only difference between sample-good.xml and sample-bad.xml is the value of 
SYSTEMID. (english and korean)

> MalformedURLException occurs when SYSTEMID value is korean character relative 
> url
> ---------------------------------------------------------------------------------
>
>          Key: XERCESJ-1156
>          URL: http://issues.apache.org/jira/browse/XERCESJ-1156
>      Project: Xerces2-J
>         Type: Bug

>     Versions: 2.6.2, 2.8.0
>  Environment: Windows XP (korean edition), JDK 1.4, JDK 5.0, xerces 2.6.0, 
> xerces 2.8.2
>     Reporter: Jongjin Choi
>  Attachments: XERCESJ-1156.jar
>
> If a xml file has a DOCTYPE and the SYSTEMID of DOCTYPE has a korean 
> character relative url,
> MalformedException occurs during parsing.
> I guess that the same problem will occur if japanese or chinese character is 
> used in SYSTEMID.
> This does not occur in JDK 1.4.2 with its internal crimson parser.
> I found that Xerces 2.6.2, Xerces 2.8.0 and JDK 1.5 have this problem.
> This is related to XERCESJ-391 which seems to be fixed over Xerces 2.6.2 in 
> my test.
> The test case xml/dtd and patch for Xerces 2.6.2 will be uploaded.
> The patch is simple. The SYSTEMID value which has non-us-ascii character 
> needs to be escaped.
> The escaping logic is from the XMLEntityManager#getUserDir().
> I think that xerces 2.8.0 can be patched in the same way.

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
   http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see:
   http://www.atlassian.com/software/jira


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to