[ https://issues.apache.org/jira/browse/NUTCH-3099?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=18019623#comment-18019623 ]
Isabelle Giguere commented on NUTCH-3099: ----------------------------------------- [~lewismc] : I will not have time to work on a patch for the foreseeable future. Moreover, I'm not really familiar with the Nutch code base, so I would need more time than most contributors, even for a simple fix. ;) If you want to jump on it, please do! > Allow wildcard '*' in http.proxy.exception.list > ----------------------------------------------- > > Key: NUTCH-3099 > URL: https://issues.apache.org/jira/browse/NUTCH-3099 > Project: Nutch > Issue Type: New Feature > Components: protocol > Affects Versions: 1.20 > Reporter: Isabelle Giguere > Priority: Major > Fix For: 1.22 > > > The Nutch setting "http.proxy.exception.list" should accept the '*' wildcards. > The equivalent JVM property "http.nonProxyHosts" does allow '*' at the start > or end of a host name. > https://docs.oracle.com/javase/8/docs/technotes/guides/net/proxies.html > Note that starting Nutch with -Dhttp.nonProxyHosts="some.host" has no effect, > crawling goes through the proxy anyways. Only "http.proxy.exception.list" > can be used with Nutch. -- This message was sent by Atlassian Jira (v8.20.10#820010)