This is an automated email from the ASF dual-hosted git repository.
snagel pushed a change to branch master
in repository https://gitbox.apache.org/repos/asf/nutch.git.
from 8151237 Merge pull request #387 from
sebastian-nagel/NUTCH-2630-fetcher-log-robotstxt-denied
add 8b7298d NUTCH-1842: crawl.gen.delay value is read incorrectly from
configuration.
new a6f533d NUTCH-2625 ProtocolFactory.getProtocol(url) may create
multiple plugin instances - lock critical block (conditional creation of plugin
instance) on object cache object
new 524a594 NUTCH-2630 Fetcher to log skipped records by robots.txt -
change required log level to INFO (default) for messages reporting skipped
URLs because of robots.txt rules (disallow or crawl delay larger than
fetcher.max.crawl.delay)
new 2a3b1d1 NUTCH-2651 Upgrade core and parse-tika to use Tika 1.19.1 -
add work-around to fix downloading of dependency javax.ws.rs-api-*.jar (need
to set property packaging.type=jar)
new 89b16ce NUTCH-2652 Fetcher launches more fetch tasks than fetch lists
- properly override method getSplits(...) of FileInputFormat
new a9ea1f1 NUTCH-2655 Update Solr schema.xml for Solr 7.x - add required
field types to schema.xml
new 48e1aef NUTCH-2659 Add missing Apache license headers
new d45fb7a NUTCH-2660 Plugin tests not executed - add missing unit test
packages to plugin build.xml - tests of "headings" plugin depend on
"lib-nekohtml" - add "protocol-okhttp" to Javadoc API overview - add missing
test packages to ant "eclipse" target
new 2d48152 NUTCH-2661 Move the TestOutlinks class into the o.a.n.parse
path
new 31a1ec4 NUTCH-2651 Upgrade to Tika 1.19.1 (from 1.18) - modified
work-around to fix downloading of dependency javax.ws.rs-api-*.jar: define
property packaging.type in ivysettings.xml
new a5df63a NUTCH-2658 Adding the fields required by the index-links
plugin to the schema
new 93b1a81 NUTCH-2671 Upgrade to ant ivy library - upgrade to 2.5.0-rc1
to address NUTCH-2669
new 393d3e5 NUTCH-2671 Upgrade to ant ivy library - fix order of ant
target dependencies: "compile-core" must come before "resolve-test"
new e6a961c NUTCH-2671 Upgrade to ant ivy library - roll back to 2.4.0 to
bring Jenkins build back to normal
new f861c82 NUTCH-1842: crawl.gen.delay value is read incorrectly from
config Merge pull request #393 from YossiTamari/patch-2
The 14 revisions listed above as "new" are entirely new to this
repository and will be described in separate emails. The revisions
listed as "add" were already present in the repository and have only
been added to this reference.
Summary of changes:
src/java/org/apache/nutch/crawl/Generator.java | 2 +-
1 file changed, 1 insertion(+), 1 deletion(-)