This is an automated email from the ASF dual-hosted git repository.
snagel pushed a change to branch master
in repository https://gitbox.apache.org/repos/asf/nutch.git.
from bd8c847 NUTCH-2386 BasicURLNormalizer does not encode curly braces
add e5a11a9 NUTCH-1932 Automatically remove orphaned pages - apply Markus
Jelsma's latest patch, 2016-06-30 - add method orphanedScore(Text, CrawlDatum)
to ScoringFilter interface - complete unit tests for CrawlDb update
new f356790 Merge pull request #211 from sebastian-nagel/NUTCH-1932
The 1 revisions listed above as "new" are entirely new to this
repository and will be described in separate emails. The revisions
listed as "add" were already present in the repository and have only
been added to this reference.
Summary of changes:
build.xml | 4 +
conf/nutch-default.xml | 28 +++++
default.properties | 1 +
src/java/org/apache/nutch/crawl/CrawlDatum.java | 4 +
src/java/org/apache/nutch/crawl/CrawlDb.java | 1 +
src/java/org/apache/nutch/crawl/CrawlDbFilter.java | 18 ++-
.../org/apache/nutch/crawl/CrawlDbReducer.java | 13 ++-
.../org/apache/nutch/scoring/ScoringFilter.java | 16 ++-
.../org/apache/nutch/scoring/ScoringFilters.java | 8 ++
src/plugin/build.xml | 3 +
.../{scoring-opic => scoring-orphan}/build.xml | 2 +-
.../ivy.xml | 0
.../{scoring-opic => scoring-orphan}/plugin.xml | 19 ++-
.../nutch/scoring/orphan/OrphanScoringFilter.java | 107 +++++++++++++++++
.../apache/nutch/scoring/orphan}/package-info.java | 5 +-
.../scoring/orphan/TestOrphanScoringFilter.java | 128 +++++++++++++++++++++
.../org/apache/nutch/crawl/TestCrawlDbStates.java | 10 +-
17 files changed, 341 insertions(+), 26 deletions(-)
copy src/plugin/{scoring-opic => scoring-orphan}/build.xml (95%)
copy src/plugin/{urlnormalizer-regex => scoring-orphan}/ivy.xml (100%)
copy src/plugin/{scoring-opic => scoring-orphan}/plugin.xml (72%)
create mode 100644
src/plugin/scoring-orphan/src/java/org/apache/nutch/scoring/orphan/OrphanScoringFilter.java
copy src/{java/org/apache/nutch/scoring =>
plugin/scoring-orphan/src/java/org/apache/nutch/scoring/orphan}/package-info.java
(83%)
create mode 100644
src/plugin/scoring-orphan/src/test/org/apache/nutch/scoring/orphan/TestOrphanScoringFilter.java
--
To stop receiving notification emails like this one, please contact
['"[email protected]" <[email protected]>'].