This is an automated email from the ASF dual-hosted git repository.
snagel pushed a change to branch master
in repository https://gitbox.apache.org/repos/asf/nutch.git.
from 9139d6e Merge pull request #526 from
sebastian-nagel/NUTCH-2419-urlfilter-rule-file-precedence
new f0e1e3d NUTCH-2720 ROBOTS metatag ignored when capitalized
new aa3a2a6 NUTCH-2720 ROBOTS metatag ignored when capitalized - move
string "robots" to constant in metadata.Nutch - make string lowercase not
depend on system locale
new 1cb64df Merge pull request #528 from sebastian-nagel/NUTCH-2720
The 3083 revisions listed above as "new" are entirely new to this
repository and will be described in separate emails. The revisions
listed as "add" were already present in the repository and have only
been added to this reference.
Summary of changes:
.../org/apache/nutch/indexer/IndexerMapReduce.java | 8 +++++---
src/java/org/apache/nutch/metadata/Nutch.java | 6 ++++++
.../apache/nutch/parse/html/HTMLMetaProcessor.java | 3 ++-
.../apache/nutch/parse/tika/HTMLMetaProcessor.java | 23 +++++++++++++---------
.../org/apache/nutch/parse/tika/TikaParser.java | 8 +++++++-
5 files changed, 34 insertions(+), 14 deletions(-)