[
https://issues.apache.org/jira/browse/UIMA-2097?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Marshall Schor reopened UIMA-2097:
----------------------------------
> Treatment of URLs with blanks is incorrect in some places
> ---------------------------------------------------------
>
> Key: UIMA-2097
> URL: https://issues.apache.org/jira/browse/UIMA-2097
> Project: UIMA
> Issue Type: Bug
> Components: Core Java Framework
> Affects Versions: 2.3.1SDK
> Reporter: Marshall Schor
> Assignee: Marshall Schor
> Priority: Minor
> Fix For: 2.3.2SDK
>
>
> User reported Document Analyzer failing to write correct names in the output
> directory, when the input directory had a blank in it (on Windows). Traced
> this to failing URL handling.
> Proper URL handling seems to need to observe these principles:
> 1) URL may have "blanks" and other "invalid-in-URI chars"
> 2) URL may have %20 style encoding of blanks and other need-to-be-escaped
> characters
> Creating files from these: need to use File(a-uri-form).
> Creating URIs from URLs - if the URL has unescaped blanks, etc., the form
> new URI(aUrl) fails; the aUrl.toURI() fails (i.e., throws an exception due to
> illegal chars for URI).
> To make the URI creation put in the escape chars if they're not there, you
> need to use the multi-arg form of new URI (see Javadocs).
> Fix the patches in UIMA-1879 and UIMA-1748.
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira