[ 
https://issues.apache.org/jira/browse/NUTCH-2189?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15069422#comment-15069422
 ] 

Markus Jelsma commented on NUTCH-2189:
--------------------------------------

Hello Sebastian - i do not think so actually. If the domainblacklist lacks 
entries, it passes everything. Only hosts or domains that are listed in 
blacklist are filtered out.

> Domain filter must deactivate if no rules are present
> -----------------------------------------------------
>
>                 Key: NUTCH-2189
>                 URL: https://issues.apache.org/jira/browse/NUTCH-2189
>             Project: Nutch
>          Issue Type: Bug
>            Reporter: Markus Jelsma
>            Assignee: Markus Jelsma
>         Attachments: NUTCH-2189.patch
>
>
> We just erased an entire CrawlDB by accident due to a misconfiguration and 
> the nice fact that the domain filter deletes everything if it has no rules. 
> This issue will deactivate the filter if no rules are present, because it 
> makes no sense to configure it without any rules.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to