Treatment of URLs with blanks is incorrect in some places
---------------------------------------------------------

                 Key: UIMA-2097
                 URL: https://issues.apache.org/jira/browse/UIMA-2097
             Project: UIMA
          Issue Type: Bug
          Components: Core Java Framework
    Affects Versions: 2.3.1SDK
            Reporter: Marshall Schor
            Assignee: Marshall Schor
            Priority: Minor
             Fix For: 2.3.2SDK


User reported Document Analyzer failing to write correct names in the output 
directory, when the input directory had a blank in it (on Windows).  Traced 
this to failing URL handling.

Proper URL handling seems to need to observe these principles:

1) URL may have "blanks" and other "invalid-in-URI chars"
2) URL may have %20 style encoding of blanks and other need-to-be-escaped 
characters

Creating files from these: need to use File(a-uri-form).

Creating URIs from URLs - if the URL has unescaped blanks, etc., the form
new URI(aUrl) fails; the aUrl.toURI() fails (i.e., throws an exception due to 
illegal chars for URI).

To make the URI creation put in the escape chars if they're not there, you need 
to use the multi-arg form of new URI (see Javadocs).

Fix the patches in UIMA-1879 and UIMA-1748.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to