[ 
https://issues.apache.org/jira/browse/NUTCH-3099?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=18029347#comment-18029347
 ] 

Hudson commented on NUTCH-3099:
-------------------------------

UNSTABLE: Integrated in Jenkins build Nutch ยป Nutch-trunk #203 (See 
[https://ci-builds.apache.org/job/Nutch/job/Nutch-trunk/203/])
NUTCH-3099 Allow wildcard '*' in http.proxy.exception.list (via Isabelle 
Giguere) (#865) (github: 
[https://github.com/apache/nutch/commit/cefb48a7502d3d57046674bd958fc5a78cc1378f])
* (edit) .github/workflows/junit-report.yml
* (add) 
src/plugin/lib-http/src/test/org/apache/nutch/protocol/http/api/TestHttpBase.java
* (edit) conf/nutch-default.xml
* (edit) 
src/plugin/lib-http/src/java/org/apache/nutch/protocol/http/api/HttpBase.java


> Allow wildcard '*' in http.proxy.exception.list
> -----------------------------------------------
>
>                 Key: NUTCH-3099
>                 URL: https://issues.apache.org/jira/browse/NUTCH-3099
>             Project: Nutch
>          Issue Type: New Feature
>          Components: protocol
>    Affects Versions: 1.20
>            Reporter: Isabelle Giguere
>            Assignee: Isabelle Giguere
>            Priority: Major
>             Fix For: 1.22
>
>         Attachments: NUTCH-3099.2025-10-08.patch.txt, 
> NUTCH-3099.2025-10-09.patch.txt
>
>
> The Nutch setting "http.proxy.exception.list" should accept the '*' wildcards.
> The equivalent JVM property "http.nonProxyHosts" does allow '*' at the start 
> or end of a host name.
> https://docs.oracle.com/javase/8/docs/technotes/guides/net/proxies.html
> Note that starting Nutch with -Dhttp.nonProxyHosts="some.host" has no effect, 
> crawling goes through the proxy anyways.  Only "http.proxy.exception.list" 
> can be used with Nutch.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to