This is an automated email from the ASF dual-hosted git repository.

snagel pushed a change to branch master
in repository https://gitbox.apache.org/repos/asf/nutch.git.


    from 9139d6e  Merge pull request #526 from 
sebastian-nagel/NUTCH-2419-urlfilter-rule-file-precedence
     new f0e1e3d  NUTCH-2720 ROBOTS metatag ignored when capitalized
     new aa3a2a6  NUTCH-2720 ROBOTS metatag ignored when capitalized - move 
string "robots" to constant in metadata.Nutch - make string lowercase not 
depend on system locale
     new 1cb64df  Merge pull request #528 from sebastian-nagel/NUTCH-2720

The 3083 revisions listed above as "new" are entirely new to this
repository and will be described in separate emails.  The revisions
listed as "add" were already present in the repository and have only
been added to this reference.


Summary of changes:
 .../org/apache/nutch/indexer/IndexerMapReduce.java |  8 +++++---
 src/java/org/apache/nutch/metadata/Nutch.java      |  6 ++++++
 .../apache/nutch/parse/html/HTMLMetaProcessor.java |  3 ++-
 .../apache/nutch/parse/tika/HTMLMetaProcessor.java | 23 +++++++++++++---------
 .../org/apache/nutch/parse/tika/TikaParser.java    |  8 +++++++-
 5 files changed, 34 insertions(+), 14 deletions(-)

Reply via email to