This is an automated email from the ASF dual-hosted git repository.
snagel pushed a change to branch master
in repository https://gitbox.apache.org/repos/asf/nutch.git.
from 33922fe NUTCH-2694 HostDB to aggregate by long instead of integer
new 3abe7db NUTCH-2695: fix some alerts raised by LGTM - `Wrong NaN
comparison` in Generator - `Type mismatch on container modification` in
NutchServerPoolExecutor - `Missing format argument` in CrawlDbReader
new 31ecf64 NUTCH-2695: fix some alerts raised by LGTM - useless checks
for null in OutlinkExtractor, protocol-http, ParsePluginsReader, parse-html,
parse-tika - called LOG.equals(...) instead of LOG.error(...) in
ArcRecordReader - constant loop condition in protocol-http - implicit
conversion from array to string in CrawlDbReducer - `Missing format argument`
in log methods - spurious Javadoc @param tags
new fd31cea Merge branch 'NUTCH-2695', closes #438
The 2848 revisions listed above as "new" are entirely new to this
repository and will be described in separate emails. The revisions
listed as "add" were already present in the repository and have only
been added to this reference.
Summary of changes:
src/java/org/apache/nutch/crawl/CrawlDbReader.java | 2 +-
.../org/apache/nutch/crawl/CrawlDbReducer.java | 4 +-
src/java/org/apache/nutch/crawl/Generator.java | 2 +-
.../apache/nutch/hostdb/UpdateHostDbMapper.java | 3 --
.../apache/nutch/hostdb/UpdateHostDbReducer.java | 2 -
.../nutch/indexer/IndexingFiltersChecker.java | 2 +-
.../org/apache/nutch/parse/OutlinkExtractor.java | 2 +-
src/java/org/apache/nutch/parse/ParseData.java | 18 ++-------
.../org/apache/nutch/parse/ParsePluginsReader.java | 2 +-
.../org/apache/nutch/segment/SegmentMerger.java | 4 +-
.../org/apache/nutch/service/impl/LinkReader.java | 8 ++--
.../org/apache/nutch/service/impl/NodeReader.java | 8 ++--
.../service/impl/NutchServerPoolExecutor.java | 2 +-
.../apache/nutch/service/impl/SequenceReader.java | 8 ++--
.../org/apache/nutch/tools/arc/ArcInputFormat.java | 4 +-
.../apache/nutch/tools/arc/ArcRecordReader.java | 2 +-
.../apache/nutch/tools/arc/ArcSegmentCreator.java | 4 +-
.../org/apache/nutch/util/EncodingDetector.java | 6 +--
src/java/org/apache/nutch/util/MimeUtil.java | 3 +-
.../apache/nutch/parse/html/HTMLMetaProcessor.java | 45 ++++++++++------------
.../apache/nutch/parse/tika/HTMLMetaProcessor.java | 45 ++++++++++------------
.../apache/nutch/protocol/http/HttpResponse.java | 4 +-
.../nutch/scoring/orphan/OrphanScoringFilter.java | 12 ++++--
23 files changed, 87 insertions(+), 105 deletions(-)