This is an automated email from the ASF dual-hosted git repository.
lewismc pushed a change to branch master
in repository https://gitbox.apache.org/repos/asf/nutch.git.
from ee8b1ef Added Furkan KAMACI as developer.
adds 61985f1 NUTCH-2372 Fixing the errors in documentation
new e9b823d Merge pull request #182 from Omkar20895/NUTCH-2372
The 1 revisions listed above as "new" are entirely new to this
repository and will be described in separate emails. The revisions
listed as "adds" were already present in the repository and have only
been added to this reference.
Summary of changes:
src/java/org/apache/nutch/crawl/FetchSchedule.java | 4 +--
src/java/org/apache/nutch/crawl/Generator.java | 2 +-
src/java/org/apache/nutch/crawl/Injector.java | 13 +++----
src/java/org/apache/nutch/hostdb/ReadHostDb.java | 2 +-
.../apache/nutch/hostdb/UpdateHostDbMapper.java | 14 ++++----
.../apache/nutch/hostdb/UpdateHostDbReducer.java | 6 ++--
src/java/org/apache/nutch/net/URLNormalizers.java | 4 +--
src/java/org/apache/nutch/parse/ParseResult.java | 2 +-
src/java/org/apache/nutch/parse/ParserChecker.java | 2 +-
.../org/apache/nutch/plugin/PluginRepository.java | 2 +-
.../org/apache/nutch/segment/SegmentMerger.java | 4 ---
src/java/org/apache/nutch/segment/SegmentPart.java | 2 +-
.../apache/nutch/service/impl/ConfManagerImpl.java | 2 +-
.../nutch/tools/AbstractCommonCrawlFormat.java | 2 +-
.../apache/nutch/tools/CommonCrawlDataDumper.java | 6 ++--
.../org/apache/nutch/tools/CommonCrawlFormat.java | 4 +--
.../nutch/tools/CommonCrawlFormatFactory.java | 6 ++--
.../nutch/tools/CommonCrawlFormatSimple.java | 3 +-
src/java/org/apache/nutch/tools/FileDumper.java | 6 +---
.../apache/nutch/tools/arc/ArcRecordReader.java | 11 +++---
.../org/apache/nutch/util/EncodingDetector.java | 5 ++-
src/java/org/apache/nutch/util/LockUtil.java | 4 +--
src/java/org/apache/nutch/util/MimeUtil.java | 4 +--
.../org/apache/nutch/util/PrefixStringMatcher.java | 8 ++---
.../org/apache/nutch/util/SuffixStringMatcher.java | 8 ++---
src/java/org/apache/nutch/util/TableUtil.java | 4 +--
src/java/org/apache/nutch/util/TimingUtil.java | 2 +-
.../org/apache/nutch/util/TrieStringMatcher.java | 8 ++---
src/java/org/apache/nutch/util/URLUtil.java | 41 ++++++----------------
.../nutch/indexer/feed/FeedIndexingFilter.java | 2 +-
.../nutch/indexer/anchor/AnchorIndexingFilter.java | 2 +-
.../nutch/indexer/geoip/GeoIPIndexingFilter.java | 3 --
.../nutch/indexer/links/LinksIndexingFilter.java | 24 ++++++-------
.../nutch/indexer/metadata/MetadataIndexer.java | 2 +-
.../nutch/indexer/replace/ReplaceIndexer.java | 18 +++++-----
.../nutch/indexwriter/dummy/DummyIndexWriter.java | 2 +-
.../apache/nutch/indexwriter/solr/SolrUtils.java | 2 +-
.../nutch/analysis/lang/HTMLLanguageParser.java | 3 +-
.../nutch/protocol/htmlunit/HtmlUnitWebDriver.java | 2 +-
.../nutch/urlfilter/api/RegexURLFilterBase.java | 4 +--
.../nutch/protocol/selenium/HttpWebClient.java | 2 +-
.../indexer/filter/MimeTypeIndexingFilter.java | 3 +-
.../apache/nutch/parse/zip/ZipTextExtractor.java | 2 +-
.../java/org/apache/nutch/protocol/ftp/Client.java | 2 +-
.../httpclient/HttpBasicAuthentication.java | 4 +--
.../nutch/scoring/opic/OPICScoringFilter.java | 5 ++-
.../scoring/similarity/util/LuceneTokenizer.java | 2 +-
.../apache/nutch/collection/CollectionManager.java | 2 +-
.../subcollection/SubcollectionIndexingFilter.java | 3 +-
.../nutch/urlfilter/domain/DomainURLFilter.java | 10 +++---
.../domainblacklist/DomainBlacklistURLFilter.java | 10 +++---
.../nutch/urlfilter/suffix/SuffixURLFilter.java | 7 ++--
.../indexer/urlmeta/URLMetaIndexingFilter.java | 6 ++--
.../scoring/urlmeta/URLMetaScoringFilter.java | 2 +-
.../querystring/QuerystringURLNormalizer.java | 2 +-
55 files changed, 130 insertions(+), 177 deletions(-)
--
To stop receiving notification emails like this one, please contact
['"[email protected]" <[email protected]>'].