This is an automated email from the ASF dual-hosted git repository. snagel pushed a change to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git.
from 9139d6e Merge pull request #526 from sebastian-nagel/NUTCH-2419-urlfilter-rule-file-precedence new f0e1e3d NUTCH-2720 ROBOTS metatag ignored when capitalized new aa3a2a6 NUTCH-2720 ROBOTS metatag ignored when capitalized - move string "robots" to constant in metadata.Nutch - make string lowercase not depend on system locale new 1cb64df Merge pull request #528 from sebastian-nagel/NUTCH-2720 The 3083 revisions listed above as "new" are entirely new to this repository and will be described in separate emails. The revisions listed as "add" were already present in the repository and have only been added to this reference. Summary of changes: .../org/apache/nutch/indexer/IndexerMapReduce.java | 8 +++++--- src/java/org/apache/nutch/metadata/Nutch.java | 6 ++++++ .../apache/nutch/parse/html/HTMLMetaProcessor.java | 3 ++- .../apache/nutch/parse/tika/HTMLMetaProcessor.java | 23 +++++++++++++--------- .../org/apache/nutch/parse/tika/TikaParser.java | 8 +++++++- 5 files changed, 34 insertions(+), 14 deletions(-)