This is an automated email from the ASF dual-hosted git repository.
snagel pushed a change to branch master
in repository https://gitbox.apache.org/repos/asf/nutch.git.
from 24ac4a0 Merge pull request #344 from
sebastian-nagel/NUTCH-2574-generator-max-count
add 4a69dbc NUTCH-2597: fixed cleanup()
add 602ff0b Fixes a few cleanup()'s and some cleanup of code
new 39da8c1 Merge pull request #349 from sju/NUTCH-2597
The 1 revisions listed above as "new" are entirely new to this
repository and will be described in separate emails. The revisions
listed as "add" were already present in the repository and have only
been added to this reference.
Summary of changes:
src/java/org/apache/nutch/crawl/CrawlDbReader.java | 62 ++++++++++------------
src/java/org/apache/nutch/crawl/LinkDb.java | 3 --
src/java/org/apache/nutch/crawl/LinkDbReader.java | 4 +-
.../apache/nutch/hostdb/UpdateHostDbMapper.java | 4 +-
.../apache/nutch/hostdb/UpdateHostDbReducer.java | 5 +-
src/java/org/apache/nutch/indexer/CleaningJob.java | 11 ++--
.../apache/nutch/scoring/webgraph/LinkDumper.java | 3 --
7 files changed, 41 insertions(+), 51 deletions(-)