This is an automated email from the ASF dual-hosted git repository.
snagel pushed a change to branch master
in repository https://gitbox.apache.org/repos/asf/nutch.git.
from 25d0cf7 Merge pull request #139 from
bmzhao/NUTCH-2296-elasticsearch-rest-indexing
adds faed27a NUTCH-2281 Support non-default FileSystem
adds d3ab941 Create temporary output as subdirectory of the final
output (CrawlDb, LinkDb) to avoid failing cross-filesystem moves
`FileSystem.rename(...)`
adds 3305321 Adapt NUTCH-2336 to NUTCH-2281
adds 5dcd7b1 NUTCH-2281 Support non-default file system - fix install
of CrawlDb for mergedb (temporary output now written to subdir of final
output directory) - uniformly lock and install CrawlDb
new f046e63 Merge branch 'sebastian-nagel/nutch:NUTCH-2281', fixes
NUTCH-2281, closes #119
The 1 revisions listed above as "new" are entirely new to this
repository and will be described in separate emails. The revisions
listed as "adds" were already present in the repository and have only
been added to this reference.
Summary of changes:
src/java/org/apache/nutch/crawl/CrawlDb.java | 62 +++++++++++-----------
src/java/org/apache/nutch/crawl/CrawlDbMerger.java | 27 ++++++----
src/java/org/apache/nutch/crawl/CrawlDbReader.java | 10 ++--
.../org/apache/nutch/crawl/DeduplicationJob.java | 2 +-
src/java/org/apache/nutch/crawl/Generator.java | 23 ++++----
src/java/org/apache/nutch/crawl/Injector.java | 7 ++-
src/java/org/apache/nutch/crawl/LinkDb.java | 19 +++----
src/java/org/apache/nutch/crawl/LinkDbMerger.java | 6 +--
src/java/org/apache/nutch/crawl/LinkDbReader.java | 2 +-
src/java/org/apache/nutch/hostdb/UpdateHostDb.java | 2 +-
.../org/apache/nutch/indexer/IndexerMapReduce.java | 2 +-
src/java/org/apache/nutch/indexer/IndexingJob.java | 2 +-
src/java/org/apache/nutch/parse/ParseSegment.java | 10 ++--
.../apache/nutch/scoring/webgraph/LinkDumper.java | 4 +-
.../apache/nutch/scoring/webgraph/LinkRank.java | 2 +-
.../apache/nutch/scoring/webgraph/NodeReader.java | 3 +-
.../nutch/scoring/webgraph/ScoreUpdater.java | 2 +-
.../apache/nutch/scoring/webgraph/WebGraph.java | 7 +--
.../org/apache/nutch/segment/SegmentMerger.java | 7 +--
.../org/apache/nutch/segment/SegmentReader.java | 28 +++-------
.../apache/nutch/tools/CommonCrawlDataDumper.java | 4 +-
src/java/org/apache/nutch/tools/FileDumper.java | 1 -
src/java/org/apache/nutch/util/LockUtil.java | 43 +++++++++++++++
23 files changed, 157 insertions(+), 118 deletions(-)
--
To stop receiving notification emails like this one, please contact
['"[email protected]" <[email protected]>'].