[ https://issues.apache.org/jira/browse/NUTCH-208?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Lewis John McGibbney updated NUTCH-208: --------------------------------------- Attachment: NUTCH-208-branch-1.4-20110210-v3.patch Patch for branch 1.4. Steps to patch and test: * download patch into $NUTCH_HOME * cd $NUTCH_HOME; patch -p0 -i NUTCH-208-branch-1.4-20110210-v3.patch * ant compile All classes should compile sucessfully as follows {code} compile: jar: deps-test: deploy: copy-generated-lib: deploy: copy-generated-lib: compile: BUILD SUCCESSFUL Total time: 19 seconds {code} After this test with the following tutorial [1], however you must add urls to the http.proxy.exception.list and restart the tinyproxy server before the functionality will be tested. [1] http://wiki.apache.org/nutch/SetupProxyForNutch I understand that this is not an urgent requirement from wither users or dev's but thought it a good way to get a patch submitted and further my knowledge of classes which I was previously unfamiliar with. > http: proxy exception list: > --------------------------- > > Key: NUTCH-208 > URL: https://issues.apache.org/jira/browse/NUTCH-208 > Project: Nutch > Issue Type: New Feature > Components: fetcher > Affects Versions: 0.8, 1.3 > Reporter: Matthias Günter > Assignee: Lewis John McGibbney > Priority: Trivial > Labels: patch > Fix For: 1.4, 2.0 > > Attachments: NUTCH-208-branch-1.4-20110210-v3.patch, > NUTCH-208-branch-1.4-20110807.patch, NUTCH-208-branch-1.4-20110809-v2.patch, > NUTCH-208-trunk-2.0-20110810.patch, patch.txt, patch.txt, > proxy_exception_list-0.8.diff > > > I suggest that a parameter is added to nutch-default.xml which allows to > generate a proxy exception list. > <property> > <name>http.proxy.exception.list</name> > <value></value> > <description>URL's and hosts that don't use the proxy (e.g. > intranets)</description> > </property> > This is useful when scanning intranet/internet combinations from behind a > firewall. A preliminary patch is added to this extend to this request, > showing the changes. We will test it and update it if necessary. this also > reflects the reality in web browsers, where there is in most cases an > exception list. -- This message is automatically generated by JIRA. For more information on JIRA, see: http://www.atlassian.com/software/jira