[
https://issues.apache.org/jira/browse/NUTCH-2189?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15069422#comment-15069422
]
Markus Jelsma commented on NUTCH-2189:
--------------------------------------
Hello Sebastian - i do not think so actually. If the domainblacklist lacks
entries, it passes everything. Only hosts or domains that are listed in
blacklist are filtered out.
> Domain filter must deactivate if no rules are present
> -----------------------------------------------------
>
> Key: NUTCH-2189
> URL: https://issues.apache.org/jira/browse/NUTCH-2189
> Project: Nutch
> Issue Type: Bug
> Reporter: Markus Jelsma
> Assignee: Markus Jelsma
> Attachments: NUTCH-2189.patch
>
>
> We just erased an entire CrawlDB by accident due to a misconfiguration and
> the nice fact that the domain filter deletes everything if it has no rules.
> This issue will deactivate the filter if no rules are present, because it
> makes no sense to configure it without any rules.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)