This is an automated email from the ASF dual-hosted git repository.
snagel pushed a change to branch master
in repository https://gitbox.apache.org/repos/asf/nutch.git.
from e61a8a3 Merge pull request #525 from sebastian-nagel/NUTCH-1945
new b543b8b NUTCH-2419 Some URL filters and normalizers do not respect
command-line override for rule file
new f971ca1 NUTCH-2419 Some URL filters and normalizers do not respect
command-line override for rule file
new 9139d6e Merge pull request #526 from
sebastian-nagel/NUTCH-2419-urlfilter-rule-file-precedence
The 3080 revisions listed above as "new" are entirely new to this
repository and will be described in separate emails. The revisions
listed as "add" were already present in the repository and have only
been added to this reference.
Summary of changes:
.../nutch/parsefilter/regex/RegexParseFilter.java | 18 +-----
.../parsefilter/regex/TestRegexParseFilter.java | 6 +-
.../nutch/urlfilter/domain/DomainURLFilter.java | 63 ++++++--------------
.../urlfilter/domain/TestDomainURLFilter.java | 6 +-
.../domainblacklist/DomainBlacklistURLFilter.java | 68 +++++++---------------
.../TestDomainBlacklistURLFilter.java | 4 +-
.../nutch/urlfilter/prefix/PrefixURLFilter.java | 39 ++++++-------
.../nutch/urlfilter/suffix/SuffixURLFilter.java | 38 +++++-------
.../net/urlnormalizer/host/HostURLNormalizer.java | 24 +++-----
.../urlnormalizer/host/TestHostURLNormalizer.java | 3 +-
.../protocol/ProtocolURLNormalizer.java | 24 +++-----
.../protocol/TestProtocolURLNormalizer.java | 3 +-
.../urlnormalizer/slash/SlashURLNormalizer.java | 26 +++------
.../slash/TestSlashURLNormalizer.java | 3 +-
14 files changed, 114 insertions(+), 211 deletions(-)