Treatment of URLs with blanks is incorrect in some places
---------------------------------------------------------
Key: UIMA-2097
URL: https://issues.apache.org/jira/browse/UIMA-2097
Project: UIMA
Issue Type: Bug
Components: Core Java Framework
Affects Versions: 2.3.1SDK
Reporter: Marshall Schor
Assignee: Marshall Schor
Priority: Minor
Fix For: 2.3.2SDK
User reported Document Analyzer failing to write correct names in the output
directory, when the input directory had a blank in it (on Windows). Traced
this to failing URL handling.
Proper URL handling seems to need to observe these principles:
1) URL may have "blanks" and other "invalid-in-URI chars"
2) URL may have %20 style encoding of blanks and other need-to-be-escaped
characters
Creating files from these: need to use File(a-uri-form).
Creating URIs from URLs - if the URL has unescaped blanks, etc., the form
new URI(aUrl) fails; the aUrl.toURI() fails (i.e., throws an exception due to
illegal chars for URI).
To make the URI creation put in the escape chars if they're not there, you need
to use the multi-arg form of new URI (see Javadocs).
Fix the patches in UIMA-1879 and UIMA-1748.
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira