[ https://issues.apache.org/jira/browse/NUTCH-1880?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14196907#comment-14196907 ]
Hudson commented on NUTCH-1880: ------------------------------- SUCCESS: Integrated in Nutch-nutchgora #1219 (See [https://builds.apache.org/job/Nutch-nutchgora/1219/]) NUTCH-1483 (including NUTCH-1879, NUTCH-1880, NUTCH-1885) fix errors related to protocol-file (snagel: http://svn.apache.org/viewvc/nutch/branches/2.x/?view=rev&rev=1636736) * /nutch/branches/2.x/CHANGES.txt * /nutch/branches/2.x/conf/nutch-default.xml * /nutch/branches/2.x/conf/regex-normalize.xml.template * /nutch/branches/2.x/src/java/org/apache/nutch/util/URLUtil.java * /nutch/branches/2.x/src/plugin/protocol-file/src/java/org/apache/nutch/protocol/file/File.java * /nutch/branches/2.x/src/plugin/urlnormalizer-regex/sample/regex-normalize-default.test * /nutch/branches/2.x/src/plugin/urlnormalizer-regex/sample/regex-normalize-default.xml * /nutch/branches/2.x/src/test/org/apache/nutch/util/TestURLUtil.java * /nutch/trunk/CHANGES.txt * /nutch/trunk/conf/nutch-default.xml * /nutch/trunk/conf/regex-normalize.xml.template * /nutch/trunk/src/java/org/apache/nutch/util/URLUtil.java * /nutch/trunk/src/plugin/protocol-file/src/java/org/apache/nutch/protocol/file/File.java * /nutch/trunk/src/plugin/urlnormalizer-regex/sample/regex-normalize-default.test * /nutch/trunk/src/plugin/urlnormalizer-regex/sample/regex-normalize-default.xml * /nutch/trunk/src/test/org/apache/nutch/util/TestURLUtil.java > URLUtil should not add additional slashes for file URLs > ------------------------------------------------------- > > Key: NUTCH-1880 > URL: https://issues.apache.org/jira/browse/NUTCH-1880 > Project: Nutch > Issue Type: Sub-task > Components: protocol > Affects Versions: 1.9, 2.2.1 > Reporter: Sebastian Nagel > Fix For: 2.3, 1.10 > > Attachments: NUTCH-1880-2x-v1.patch, NUTCH-1880-trunk-v1.patch > > > UrlUtil.toASCII(String url) and .toUNICODE(String url) add two slashes to > file URLs if it contains a single slash: {{file:/path/index.html}} becomes > {{file:///path/index.html}}. Both methods should keep the single slash to get > a behavior consistent with URL.toString(). See NUTCH-1483 for details. -- This message was sent by Atlassian JIRA (v6.3.4#6332)