[
https://issues.apache.org/jira/browse/NUTCH-1880?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-1880:
-----------------------------------
Attachment: NUTCH-1880-2x-v1.patch
NUTCH-1880-trunk-v1.patch
> URLUtil should not add additional slashes for file URLs
> -------------------------------------------------------
>
> Key: NUTCH-1880
> URL: https://issues.apache.org/jira/browse/NUTCH-1880
> Project: Nutch
> Issue Type: Sub-task
> Components: protocol
> Affects Versions: 1.9, 2.2.1
> Reporter: Sebastian Nagel
> Fix For: 2.3, 1.10
>
> Attachments: NUTCH-1880-2x-v1.patch, NUTCH-1880-trunk-v1.patch
>
>
> UrlUtil.toASCII(String url) and .toUNICODE(String url) add two slashes to
> file URLs if it contains a single slash: {{file:/path/index.html}} becomes
> {{file:///path/index.html}}. Both methods should keep the single slash to get
> a behavior consistent with URL.toString(). See NUTCH-1483 for details.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)