Dear Wiki user, You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change notification.
The "SetupProxyForNutch" page has been changed by LewisJohnMcgibbney: http://wiki.apache.org/nutch/SetupProxyForNutch?action=diff&rev1=13&rev2=14 If necessary these will act as a blacklist, because of FilterDefaultDeny No. This property changes the default policy of the filtering system. If this directive is commented out, or is set to "No" then the default policy is to allow everything which is not specifically denied by the filter file. However, by setting this directive to "Yes" the default policy becomes to deny everything which is _not_ specifically allowed by the filter file e.g. the inverse. + + Tinyproxy supports filtering of web sites based on URLs or domains. We need to specify the location of a text file containing the filter rules, one rule per line. This can be done as follows {{{ - sudo vi /etc/filter + sudo mkdir /etc/filter + sudo cat filters.txt }}} and add sites urls to be blocked. The list should comprise of single URLs, one per line. {{{

