This is an automated email from the ASF dual-hosted git repository.
snagel pushed a change to branch master
in repository https://gitbox.apache.org/repos/asf/nutch.git
from 312828602 NUTCH-2976 SitemapProcessor: verify sitemap values
added from sitemap to CrawlDB (priority, modification time and
change frequency) - use default priority if priority <= 0.0 (a CrawlDatum
with score 0.0 is not eligible for fetch) - ensure that the fetch interval
(from change frequency) is within db.fetch.schedule.adaptive.min_interval
and db.fetch.schedule.adaptive.max_interval - ignore last-modified times in the
future
add d2adde295 Updated Arbitrary Indexer that passes all indexer
constructor args to user's POJO instance.
add 1834c89a3 Corrected element order on a couple nutch-default.xml nodes
for index-arbitrary configs.
add 154504ba2 Added Apache license to source of toy class used by JUnit
test.
add 94a993528 NUTCH-3115 update to set all fields access on each POJO
individually, updated JUnit tests, improved logging
new e85001205 Merge pull request #856 from CatChullain/NUTCH-3115
The 1 revisions listed above as "new" are entirely new to this
repository and will be described in separate emails. The revisions
listed as "add" were already present in the repository and have only
been added to this reference.
Summary of changes:
conf/nutch-default.xml | 10 ++
.../indexer/arbitrary/ArbitraryIndexingFilter.java | 46 ++++++--
.../nutch/indexer/arbitrary/PopularityGauge.java | 86 ++++++++++++++
.../arbitrary/TestArbitraryIndexingFilter.java | 125 ++++++++++++++++++++-
.../arbitrary/{Echo.java => UpdatedEcho.java} | 33 ++++--
.../{Multiplier.java => UpdatedMultiplier.java} | 29 +++--
6 files changed, 305 insertions(+), 24 deletions(-)
create mode 100644
src/plugin/index-arbitrary/src/test/org/apache/nutch/indexer/arbitrary/PopularityGauge.java
copy
src/plugin/index-arbitrary/src/test/org/apache/nutch/indexer/arbitrary/{Echo.java
=> UpdatedEcho.java} (59%)
copy
src/plugin/index-arbitrary/src/test/org/apache/nutch/indexer/arbitrary/{Multiplier.java
=> UpdatedMultiplier.java} (71%)