[ https://issues.apache.org/jira/browse/NUTCH-2705?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Sebastian Nagel updated NUTCH-2705: ----------------------------------- Fix Version/s: 1.22 (was: 1.21) > urlfilter-validator rejects IPv6 URLs > ------------------------------------- > > Key: NUTCH-2705 > URL: https://issues.apache.org/jira/browse/NUTCH-2705 > Project: Nutch > Issue Type: Bug > Components: plugin > Affects Versions: 1.15 > Reporter: Sebastian Nagel > Priority: Minor > Fix For: 1.22 > > > The plugin urlfilter-validator rejects URLs with an IPv6 address as > hostname/authority (given according to [RFC > 2732|https://tools.ietf.org/html/rfc2732]: > {noformat} > % echo "http://[2010:836B:4179::836B:4179]/" \ > | bin/nutch filterchecker -filterName urlfilter-validator -stdin > Checking combination of these URLFilters: UrlValidator > -http://[2010:836B:4179::836B:4179]/ > {noformat} > We should also consider to use the class > [UrlValidator|https://commons.apache.org/proper/commons-validator/apidocs/org/apache/commons/validator/routines/UrlValidator.html] > from commons-validator directly instead of a modified copy. This would help > to get updates and improvements with little effort - IPv6 is already > supported, see the [class > implementation|https://commons.apache.org/proper/commons-validator/apidocs/src-html/org/apache/commons/validator/routines/UrlValidator.html#line.380]. -- This message was sent by Atlassian Jira (v8.20.10#820010)