[nutch] 02/14: NUTCH-2630 Fetcher to log skipped records by robots.txt - change required log level to INFO (default) for messages reporting skipped URLs because of robots.txt rules (disallow or crawl

2018-11-15 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git commit 524a59480a3e258a0363faf343fa57875f8f9ea8 Author: Sebastian Nagel AuthorDate: Mon Oct 8 14:50:51 2018 +0200

[nutch] 14/14: NUTCH-1842: crawl.gen.delay value is read incorrectly from config Merge pull request #393 from YossiTamari/patch-2

2018-11-15 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git commit f861c8203c8544b91e061964441485bd2f6de145 Merge: 8151237 e6a961c Author: Sebastian Nagel AuthorDate: Thu Nov 15 11:17:37

[nutch] 09/14: NUTCH-2651 Upgrade to Tika 1.19.1 (from 1.18) - modified work-around to fix downloading of dependency javax.ws.rs-api-*.jar: define property packaging.type in ivysettings.xml

2018-11-15 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git commit 31a1ec4bab4a702fa8876926d54b212cc40acbce Author: Sebastian Nagel AuthorDate: Sun Oct 21 20:49:51 2018 +0200

[nutch] 01/14: NUTCH-2625 ProtocolFactory.getProtocol(url) may create multiple plugin instances - lock critical block (conditional creation of plugin instance) on object cache object

2018-11-15 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git commit a6f533dfecd688a6c43212b0e826be9a2da5b4ce Author: Sebastian Nagel AuthorDate: Tue Jul 24 16:19:04 2018 +0200

[nutch] 12/14: NUTCH-2671 Upgrade to ant ivy library - fix order of ant target dependencies: "compile-core" must come before "resolve-test"

2018-11-15 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git commit 393d3e5f96c0f381b904e17e5abcad695f911e5e Author: Sebastian Nagel AuthorDate: Tue Oct 30 16:45:22 2018 +0100

[nutch] 03/14: NUTCH-2651 Upgrade core and parse-tika to use Tika 1.19.1 - add work-around to fix downloading of dependency javax.ws.rs-api-*.jar (need to set property packaging.type=jar)

2018-11-15 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git commit 2a3b1d15fdebe7ada325b9b955c164270a21e127 Author: Sebastian Nagel AuthorDate: Fri Oct 12 13:47:43 2018 +0200

[nutch] 13/14: NUTCH-2671 Upgrade to ant ivy library - roll back to 2.4.0 to bring Jenkins build back to normal

2018-11-15 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git commit e6a961ce967e94dc7128154b68cfa24fcd4370e9 Author: Sebastian Nagel AuthorDate: Tue Oct 30 17:47:22 2018 +0100

[nutch] 05/14: NUTCH-2655 Update Solr schema.xml for Solr 7.x - add required field types to schema.xml

2018-11-15 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git commit a9ea1f1012f6d1b4296d4728b00cf7498aa05dba Author: Sebastian Nagel AuthorDate: Mon Oct 15 15:04:01 2018 +0200

[nutch] 06/14: NUTCH-2659 Add missing Apache license headers

2018-11-15 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git commit 48e1aef83b94468c9f839cf28b24560bef233780 Author: Sebastian Nagel AuthorDate: Wed Oct 17 14:23:44 2018 +0200

[nutch] branch master updated (8151237 -> f861c82)

2018-11-15 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a change to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git. from 8151237 Merge pull request #387 from sebastian-nagel/NUTCH-2630-fetcher-log-robotstxt-denied add 8b7298d

[nutch] 10/14: NUTCH-2658 Adding the fields required by the index-links plugin to the schema

2018-11-15 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git commit a5df63a3d644e90fb881a0f16c8f29d9320d1de3 Author: Jorge Luis Betancourt AuthorDate: Tue Oct 23 22:57:03 2018 +0200

[nutch] 07/14: NUTCH-2660 Plugin tests not executed - add missing unit test packages to plugin build.xml - tests of "headings" plugin depend on "lib-nekohtml" - add "protocol-okhttp" to Javadoc API ov

2018-11-15 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git commit d45fb7a659ba29371f171817a6a6de72965189c3 Author: Sebastian Nagel AuthorDate: Wed Oct 17 14:36:58 2018 +0200

[nutch] 08/14: NUTCH-2661 Move the TestOutlinks class into the o.a.n.parse path

2018-11-15 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git commit 2d48152db0d032a58ea2324e8b40b6c5c48d7cd6 Author: Jorge Luis Betancourt Gonzalez AuthorDate: Wed Oct 17 18:07:51 2018

[nutch] 11/14: NUTCH-2671 Upgrade to ant ivy library - upgrade to 2.5.0-rc1 to address NUTCH-2669

2018-11-15 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git commit 93b1a8174254de83232be12ac18d99ca4fa83518 Author: Sebastian Nagel AuthorDate: Mon Oct 29 13:41:42 2018 +0100

[nutch] 04/14: NUTCH-2652 Fetcher launches more fetch tasks than fetch lists - properly override method getSplits(...) of FileInputFormat

2018-11-15 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git commit 89b16ce29f3bf6618ec2bf9df0807b24c1e40339 Author: Sebastian Nagel AuthorDate: Mon Oct 15 13:44:20 2018 +0200