[nutch] branch master updated (b38077b -> 1130e68)

2018-08-17 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a change to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git. from b38077b NUTCH-2632 protocol-okhttp doesn't accept proxy authentication - merge PR #375 from branch '

[nutch] 01/01: Merge pull request #365 from sebastian-nagel/NUTCH-2621-3rd-party-license-report

2018-08-17 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git commit 1130e684fb1b525b12eebc561b985466edd72168 Merge: b38077b 1f148ba Author: Sebastian Nagel AuthorDate: Fri Aug 17 11:59:50

[nutch] 01/01: Merge pull request #379 from rustyx/NUTCH-2640

2018-09-11 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch 2.x in repository https://gitbox.apache.org/repos/asf/nutch.git commit 7f1b01f979795678e79849af550602c235ba95bc Merge: c43c2c8 fb55479 Author: Sebastian Nagel AuthorDate: Tue Sep 11 10:23:09

[nutch] branch 2.x updated (c43c2c8 -> 7f1b01f)

2018-09-11 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a change to branch 2.x in repository https://gitbox.apache.org/repos/asf/nutch.git. from c43c2c8 NUTCH- re-fetch deletes all metadata except _csh_ and _rs_ add 7e45020 A fix for NUTCH-2639. Will not

[nutch] branch master updated: NUTCH-2639 bin/nutch fails to set native library path on Cygwin causing jobs to fail with UnsatisfiedLinkError Pick fix contributed by rustyx for 2.x

2018-09-11 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git The following commit(s) were added to refs/heads/master by this push: new 566f3fb NUTCH-2639 bin/nutch fails to set

[nutch] branch 2.x updated (7f1b01f -> 69e9e92)

2018-09-12 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a change to branch 2.x in repository https://gitbox.apache.org/repos/asf/nutch.git. from 7f1b01f Merge pull request #379 from rustyx/NUTCH-2640 add ef140f4 NUTCH-2637 fix number of reducers to run

[nutch] 01/01: Merge pull request #381 from sinsi404/NUTCH-2637

2018-09-12 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch 2.x in repository https://gitbox.apache.org/repos/asf/nutch.git commit 69e9e92bb567191c6b02d30c2751a500d89d0c11 Merge: 7f1b01f ef140f4 Author: Sebastian Nagel AuthorDate: Wed Sep 12 11:06:10

[nutch] 01/01: Merge pull request #386 from sebastian-nagel/NUTCH-2642-index-more-date-timezone-2x

2018-10-07 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch 2.x in repository https://gitbox.apache.org/repos/asf/nutch.git commit 6f3cb465ebdffd05ed1260324767a690599f96e6 Merge: 69e9e92 9dc57fb Author: Sebastian Nagel AuthorDate: Sun Oct 7 19:09:32 2018

[nutch] branch master updated (61d7e8c -> 8c55414)

2018-10-07 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a change to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git. from 61d7e8c NUTCH-2647 Skip TLS certificate checks in protocol-http plugin add d3864d6 NUTCH-2642

[nutch] 01/01: Merge pull request #385 from sebastian-nagel/NUTCH-2642-index-more-date-timezone

2018-10-07 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git commit 8c55414f6779cf2d32bf27c31194cd717d16c99f Merge: 61d7e8c d3864d6 Author: Sebastian Nagel AuthorDate: Sun Oct 7 19:09:41

[nutch] branch 2.x updated (69e9e92 -> 6f3cb46)

2018-10-07 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a change to branch 2.x in repository https://gitbox.apache.org/repos/asf/nutch.git. from 69e9e92 Merge pull request #381 from sinsi404/NUTCH-2637 add 9dc57fb NUTCH-2642 MoreIndexingFilter parses ISO

[nutch] branch 2.x updated (6f3cb46 -> 1e7f12b)

2018-10-07 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a change to branch 2.x in repository https://gitbox.apache.org/repos/asf/nutch.git. from 6f3cb46 Merge pull request #386 from sebastian-nagel/NUTCH-2642-index-more-date-timezone-2x add 468b707 Fix for

[nutch] 01/01: Merge pull request #380 from rustyx/NUTCH-2641

2018-10-07 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch 2.x in repository https://gitbox.apache.org/repos/asf/nutch.git commit 1e7f12b05f4a869b9a3f9ddd02c5d17ee99e83b9 Merge: 6f3cb46 468b707 Author: Sebastian Nagel AuthorDate: Sun Oct 7 20:41:12 2018

[nutch] branch master updated (8c55414 -> ec9e3d8)

2018-10-07 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a change to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git. from 8c55414 Merge pull request #385 from sebastian-nagel/NUTCH-2642-index-more-date-timezone add 113c58e NUTCH

[nutch] 01/01: Merge pull request #376 from sebastian-nagel/NUTCH-2635-generator-temporary-output

2018-10-07 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git commit ec9e3d8deb481e40914d36077e75c049896f60d5 Merge: 8c55414 113c58e Author: Sebastian Nagel AuthorDate: Sun Oct 7 20:44:10

[nutch] branch master updated (ec9e3d8 -> 525e241)

2018-10-07 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a change to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git. from ec9e3d8 Merge pull request #376 from sebastian-nagel/NUTCH-2635-generator-temporary-output add 7ed4204 NUTCH

[nutch] 01/01: Merge pull request #382 from sebastian-nagel/NUTCH-2634-ant-resolve-default

2018-10-07 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git commit 525e241e86e7cc000224bca0a41519fd777f84b0 Merge: ec9e3d8 7ed4204 Author: Sebastian Nagel AuthorDate: Sun Oct 7 20:52:41

[nutch] branch master updated (525e241 -> 0ce62e1)

2018-10-07 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a change to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git. from 525e241 Merge pull request #382 from sebastian-nagel/NUTCH-2634-ant-resolve-default add 6f5c50e NUTCH-2644

[nutch] 01/01: Merge pull request #383 from sebastian-nagel/NUTCH-2644-crawldb-reader

2018-10-07 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git commit 0ce62e10fb1f107131bff955a44a7105b62521a0 Merge: 525e241 497db00 Author: Sebastian Nagel AuthorDate: Sun Oct 7 21:08:53

[nutch] 01/01: Merge pull request #369 from sebastian-nagel/NUTCH-2623-fetcher-queue-mode

2018-10-07 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git commit 4d2938f8c54447b603f4f133133ac755f4ac62b6 Merge: 0ce62e1 d1ffe61 Author: Sebastian Nagel AuthorDate: Sun Oct 7 21:12:08

[nutch] branch master updated (0ce62e1 -> 4d2938f)

2018-10-07 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a change to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git. from 0ce62e1 Merge pull request #383 from sebastian-nagel/NUTCH-2644-crawldb-reader add d1ffe61 NUTCH-2623 Fetcher

[nutch] 01/01: Merge pull request #388 from sebastian-nagel/NUTCH-2648-configurable-tls-cert-check

2018-10-09 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git commit 5fb314027205724cf1ddce63f73641166be838ec Merge: 4d2938f 58ea01f Author: Sebastian Nagel AuthorDate: Tue Oct 9 14:57:33

[nutch] branch master updated (4d2938f -> 5fb3140)

2018-10-09 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a change to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git. from 4d2938f Merge pull request #369 from sebastian-nagel/NUTCH-2623-fetcher-queue-mode add 3f64083 NUTCH-2648

[nutch] branch master updated (5fb3140 -> a997f10)

2018-10-13 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a change to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git. from 5fb3140 Merge pull request #388 from sebastian-nagel/NUTCH-2648-configurable-tls-cert-check add c532c4e NUTCH

[nutch] 01/01: Merge pull request #389 from sebastian-nagel/NUTCH-2192-remove-oro

2018-10-13 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git commit a997f102878dc979f2ce542e63e860edeaf65f68 Merge: 5fb3140 4418a0d Author: Sebastian Nagel AuthorDate: Sat Oct 13 11:52:25

[nutch] branch 2.x updated (1e7f12b -> 37d7eee)

2018-10-13 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a change to branch 2.x in repository https://gitbox.apache.org/repos/asf/nutch.git. from 1e7f12b Merge pull request #380 from rustyx/NUTCH-2641 add b2d9058 NUTCH-1678 Remove dependency on org.apache.oro

[nutch] 01/01: Merge pull request #390 from sebastian-nagel/NUTCH-2192-remove-oro-2x

2018-10-13 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch 2.x in repository https://gitbox.apache.org/repos/asf/nutch.git commit 37d7eee2541fd5f69a5c0310c7fb1fc01fc32c7f Merge: 1e7f12b b2d9058 Author: Sebastian Nagel AuthorDate: Sat Oct 13 11:53:57

[nutch] 01/01: Merge pull request #368 from sebastian-nagel/NUTCH-2625-protocolfactory-getprotocol-synchronized

2018-10-20 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git commit 59395791978e4b8e1364ce84a399139b7f3483cb Merge: a997f10 a10db14 Author: Sebastian Nagel AuthorDate: Sat Oct 20 19:13:00

[nutch] branch master updated (a997f10 -> 5939579)

2018-10-20 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a change to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git. from a997f10 Merge pull request #389 from sebastian-nagel/NUTCH-2192-remove-oro add a10db14 NUTCH-2625

[nutch] branch master updated (5939579 -> 95e9d66)

2018-10-20 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a change to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git. from 5939579 Merge pull request #368 from sebastian-nagel/NUTCH-2625-protocolfactory-getprotocol-synchronized add

[nutch] 01/01: Merge pull request #397 from sebastian-nagel/NUTCH-2660-execute-plugin-tests

2018-10-20 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git commit 95e9d464851a37233df5adab217d58e7361e Merge: 5939579 2b7dc0f Author: Sebastian Nagel AuthorDate: Sat Oct 20 19:16:11

[nutch] branch master updated (95e9d66 -> 4426ca9)

2018-10-20 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a change to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git. from 95e9d66 Merge pull request #397 from sebastian-nagel/NUTCH-2660-execute-plugin-tests add 24faf03 NUTCH-2651

[nutch] 01/01: Merge pull request #391 from sebastian-nagel/NUTCH-2651-tika-1.19.1

2018-10-20 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git commit 4426ca99b98ff66a65ce815843b5e00b6b658614 Merge: 95e9d66 24faf03 Author: Sebastian Nagel AuthorDate: Sat Oct 20 19:19:28

[nutch] branch master updated (4426ca9 -> 1abd6dd)

2018-10-20 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a change to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git. from 4426ca9 Merge pull request #391 from sebastian-nagel/NUTCH-2651-tika-1.19.1 add a6de472 NUTCH-2652 Fetcher

[nutch] 01/01: Merge pull request #394 from sebastian-nagel/NUTCH-2652-fetcher-not-split-inputs

2018-10-20 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git commit 1abd6dd7a3167e996e310349a36e7100a585bf98 Merge: 4426ca9 a6de472 Author: Sebastian Nagel AuthorDate: Sat Oct 20 19:36:11

[nutch] branch master updated: NUTCH-2651 Upgrade to Tika 1.19.1 (from 1.18) - modified work-around to fix downloading of dependency javax.ws.rs-api-*.jar: define property packaging.type in ivysetting

2018-10-21 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git The following commit(s) were added to refs/heads/master by this push: new 65c4fed NUTCH-2651 Upgrade to Tika 1.19.1 (from

[nutch] 01/01: Merge pull request #406 from sebastian-nagel/NUTCH-2671-ivy-lib-upgrade

2018-10-30 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git commit 426650f6f11d17fe7627d74391f4d596859b9ffe Merge: 5e3de5b ed142e5 Author: Sebastian Nagel AuthorDate: Tue Oct 30 10:52:16

[nutch] branch master updated (5e3de5b -> 426650f)

2018-10-30 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a change to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git. from 5e3de5b Merge pull request #396 from sebastian-nagel/NUTCH-2659-license-headers add ed142e5 NUTCH-2671

[nutch] branch 2.x updated (37d7eee -> a0a7be6)

2018-10-30 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a change to branch 2.x in repository https://gitbox.apache.org/repos/asf/nutch.git. from 37d7eee Merge pull request #390 from sebastian-nagel/NUTCH-2192-remove-oro-2x add c508f21 NUTCH-2671 Upgrade to

[nutch] 01/01: Merge pull request #405 from sebastian-nagel/NUTCH-2671-ivy-lib-upgrade-2x

2018-10-30 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch 2.x in repository https://gitbox.apache.org/repos/asf/nutch.git commit a0a7be69e79a02355d4ba83def9594b2e54b9934 Merge: 37d7eee c508f21 Author: Sebastian Nagel AuthorDate: Tue Oct 30 10:52:21

[nutch] branch master updated: NUTCH-2671 Upgrade to ant ivy library - fix order of ant target dependencies: "compile-core" must come before "resolve-test"

2018-10-30 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git The following commit(s) were added to refs/heads/master by this push: new 9a89898 NUTCH-2671 Upgrade to ant ivy library

[nutch] branch master updated: NUTCH-2671 Upgrade to ant ivy library - roll back to 2.4.0 to bring Jenkins build back to normal

2018-10-30 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git The following commit(s) were added to refs/heads/master by this push: new 73bb0a7 NUTCH-2671 Upgrade to ant ivy library

[nutch] branch 2.x updated: NUTCH-2671 Upgrade to ant ivy library - roll back to 2.4.0 to bring Jenkins build back to normal

2018-10-30 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch 2.x in repository https://gitbox.apache.org/repos/asf/nutch.git The following commit(s) were added to refs/heads/2.x by this push: new 855e650 NUTCH-2671 Upgrade to ant ivy library - roll

[nutch] 01/01: Merge pull request #395 from sebastian-nagel/NUTCH-2655-solr-schema-7x

2018-11-14 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git commit f443f1b4f0bcf83f0ed5c028875dc16cddf58b92 Merge: 898ba0e 1a9f2e6 Author: Sebastian Nagel AuthorDate: Wed Nov 14 10:10:31

[nutch] branch master updated (898ba0e -> f443f1b)

2018-11-14 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a change to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git. from 898ba0e Merge pull request #402 from jorgelbg/index-links-schema add 1a9f2e6 NUTCH-2655 Update Solr schema.xml

[nutch] branch master updated (f443f1b -> 8151237)

2018-11-14 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a change to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git. from f443f1b Merge pull request #395 from sebastian-nagel/NUTCH-2655-solr-schema-7x add 54f156c NUTCH-2630 Fetcher

[nutch] 01/01: Merge pull request #387 from sebastian-nagel/NUTCH-2630-fetcher-log-robotstxt-denied

2018-11-14 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git commit 8151237a4000972f79ee30b371cc7be1dbb10d04 Merge: f443f1b 54f156c Author: Sebastian Nagel AuthorDate: Wed Nov 14 13:04:49

[nutch] 12/14: NUTCH-2671 Upgrade to ant ivy library - fix order of ant target dependencies: "compile-core" must come before "resolve-test"

2018-11-15 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git commit 393d3e5f96c0f381b904e17e5abcad695f911e5e Author: Sebastian Nagel AuthorDate: Tue Oct 30 16:45:22 2018 +0100 NUTCH

[nutch] 03/14: NUTCH-2651 Upgrade core and parse-tika to use Tika 1.19.1 - add work-around to fix downloading of dependency javax.ws.rs-api-*.jar (need to set property packaging.type=jar)

2018-11-15 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git commit 2a3b1d15fdebe7ada325b9b955c164270a21e127 Author: Sebastian Nagel AuthorDate: Fri Oct 12 13:47:43 2018 +0200 NUTCH

[nutch] 13/14: NUTCH-2671 Upgrade to ant ivy library - roll back to 2.4.0 to bring Jenkins build back to normal

2018-11-15 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git commit e6a961ce967e94dc7128154b68cfa24fcd4370e9 Author: Sebastian Nagel AuthorDate: Tue Oct 30 17:47:22 2018 +0100 NUTCH

[nutch] 05/14: NUTCH-2655 Update Solr schema.xml for Solr 7.x - add required field types to schema.xml

2018-11-15 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git commit a9ea1f1012f6d1b4296d4728b00cf7498aa05dba Author: Sebastian Nagel AuthorDate: Mon Oct 15 15:04:01 2018 +0200 NUTCH

[nutch] 06/14: NUTCH-2659 Add missing Apache license headers

2018-11-15 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git commit 48e1aef83b94468c9f839cf28b24560bef233780 Author: Sebastian Nagel AuthorDate: Wed Oct 17 14:23:44 2018 +0200 NUTCH

[nutch] branch master updated (8151237 -> f861c82)

2018-11-15 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a change to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git. from 8151237 Merge pull request #387 from sebastian-nagel/NUTCH-2630-fetcher-log-robotstxt-denied add 8b7298d

[nutch] 10/14: NUTCH-2658 Adding the fields required by the index-links plugin to the schema

2018-11-15 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git commit a5df63a3d644e90fb881a0f16c8f29d9320d1de3 Author: Jorge Luis Betancourt AuthorDate: Tue Oct 23 22:57:03 2018 +0200

[nutch] 07/14: NUTCH-2660 Plugin tests not executed - add missing unit test packages to plugin build.xml - tests of "headings" plugin depend on "lib-nekohtml" - add "protocol-okhttp" to Javadoc API ov

2018-11-15 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git commit d45fb7a659ba29371f171817a6a6de72965189c3 Author: Sebastian Nagel AuthorDate: Wed Oct 17 14:36:58 2018 +0200 NUTCH

[nutch] 08/14: NUTCH-2661 Move the TestOutlinks class into the o.a.n.parse path

2018-11-15 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git commit 2d48152db0d032a58ea2324e8b40b6c5c48d7cd6 Author: Jorge Luis Betancourt Gonzalez AuthorDate: Wed Oct 17 18:07:51 2018

[nutch] 11/14: NUTCH-2671 Upgrade to ant ivy library - upgrade to 2.5.0-rc1 to address NUTCH-2669

2018-11-15 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git commit 93b1a8174254de83232be12ac18d99ca4fa83518 Author: Sebastian Nagel AuthorDate: Mon Oct 29 13:41:42 2018 +0100 NUTCH

[nutch] 04/14: NUTCH-2652 Fetcher launches more fetch tasks than fetch lists - properly override method getSplits(...) of FileInputFormat

2018-11-15 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git commit 89b16ce29f3bf6618ec2bf9df0807b24c1e40339 Author: Sebastian Nagel AuthorDate: Mon Oct 15 13:44:20 2018 +0200 NUTCH

[nutch] 02/14: NUTCH-2630 Fetcher to log skipped records by robots.txt - change required log level to INFO (default) for messages reporting skipped URLs because of robots.txt rules (disallow or crawl

2018-11-15 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git commit 524a59480a3e258a0363faf343fa57875f8f9ea8 Author: Sebastian Nagel AuthorDate: Mon Oct 8 14:50:51 2018 +0200 NUTCH

[nutch] 14/14: NUTCH-1842: crawl.gen.delay value is read incorrectly from config Merge pull request #393 from YossiTamari/patch-2

2018-11-15 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git commit f861c8203c8544b91e061964441485bd2f6de145 Merge: 8151237 e6a961c Author: Sebastian Nagel AuthorDate: Thu Nov 15 11:17:37

[nutch] 09/14: NUTCH-2651 Upgrade to Tika 1.19.1 (from 1.18) - modified work-around to fix downloading of dependency javax.ws.rs-api-*.jar: define property packaging.type in ivysettings.xml

2018-11-15 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git commit 31a1ec4bab4a702fa8876926d54b212cc40acbce Author: Sebastian Nagel AuthorDate: Sun Oct 21 20:49:51 2018 +0200 NUTCH

[nutch] 01/14: NUTCH-2625 ProtocolFactory.getProtocol(url) may create multiple plugin instances - lock critical block (conditional creation of plugin instance) on object cache object

2018-11-15 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git commit a6f533dfecd688a6c43212b0e826be9a2da5b4ce Author: Sebastian Nagel AuthorDate: Tue Jul 24 16:19:04 2018 +0200 NUTCH

[nutch] branch master updated (f861c82 -> ee9ff89)

2018-11-19 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a change to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git. from f861c82 NUTCH-1842: crawl.gen.delay value is read incorrectly from config Merge pull request #393 from YossiTamari

[nutch] 01/01: Merge pull request #392 from sebastian-nagel/NUTCH-2606-mime-detection-plain-text

2018-11-19 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git commit ee9ff89042acc0f9f13987ed3f303d485d82db4e Merge: f861c82 5f53fd4 Author: Sebastian Nagel AuthorDate: Mon Nov 19 21:52:24

[nutch] branch master updated: NUTCH-1842: crawl.gen.delay value is read incorrectly from config - add warning to CHANGES.txt

2018-11-19 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git The following commit(s) were added to refs/heads/master by this push: new a37bde1 NUTCH-1842: crawl.gen.delay value is

[nutch] branch master updated (a37bde1 -> 785a52f)

2018-11-19 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a change to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git. from a37bde1 NUTCH-1842: crawl.gen.delay value is read incorrectly from config - add warning to CHANGES.txt add

[nutch] 01/01: Merge pull request #401 from sebastian-nagel/dependency-check

2018-11-19 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git commit 785a52f897cab00711417be8fd002b32f8b2c93e Merge: a37bde1 3e9a6e4 Author: Sebastian Nagel AuthorDate: Mon Nov 19 21:57:25

[nutch] branch 2.x updated (855e650 -> 5013b9e)

2018-11-19 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a change to branch 2.x in repository https://gitbox.apache.org/repos/asf/nutch.git. from 855e650 NUTCH-2671 Upgrade to ant ivy library - roll back to 2.4.0 to bring Jenkins build back to normal add

[nutch] 01/01: Merge pull request #404 from sebastian-nagel/dependency-check-2x

2018-11-19 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch 2.x in repository https://gitbox.apache.org/repos/asf/nutch.git commit 5013b9e56cc128d10b31e32e771bd7b0c4aec9b2 Merge: 855e650 f88d73d Author: Sebastian Nagel AuthorDate: Mon Nov 19 21:58:05

[nutch] branch master updated: NUTCH-2668 Integrate OWASP dependency checks as ant target - relax ant build if the OWASP dependency check tool is not installed

2018-11-19 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git The following commit(s) were added to refs/heads/master by this push: new a965cd2 NUTCH-2668 Integrate OWASP dependency

[nutch] branch 2.x updated: NUTCH-2668 Integrate OWASP dependency checks as ant target - relax ant build if the OWASP dependency check tool is not installed

2018-11-19 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch 2.x in repository https://gitbox.apache.org/repos/asf/nutch.git The following commit(s) were added to refs/heads/2.x by this push: new 6adca89 NUTCH-2668 Integrate OWASP dependency checks

[nutch] 01/01: Merge pull request #407 from sebastian-nagel/NUTCH-2674-hostdb-dump-header

2018-12-07 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git commit 43d26ceceab274e06d607215b06aacf20ff89287 Merge: a965cd2 230d1a2 Author: Sebastian Nagel AuthorDate: Fri Dec 7 17:56:40

[nutch] branch master updated (a965cd2 -> 43d26ce)

2018-12-07 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a change to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git. from a965cd2 NUTCH-2668 Integrate OWASP dependency checks as ant target - relax ant build if the OWASP dependency check

[nutch] 01/01: Merge pull request #423 from sebastian-nagel/NUTCH-2667-upgrade-tika-commons-collections4

2019-01-04 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch 2.x in repository https://gitbox.apache.org/repos/asf/nutch.git commit add07fa68129039fb277114a8af016404ee78873 Merge: 6adca89 7044365 Author: Sebastian Nagel AuthorDate: Fri Jan 4 09:48:56 2019

[nutch] branch 2.x updated (6adca89 -> add07fa)

2019-01-04 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a change to branch 2.x in repository https://gitbox.apache.org/repos/asf/nutch.git. from 6adca89 NUTCH-2668 Integrate OWASP dependency checks as ant target - relax ant build if the OWASP dependency check

[nutch] branch 2.x updated: NUTCH-2667 Update Tika and Commons Collections 4 - explicitly add dependency to commons-compress 1.18 for tika-core

2019-01-04 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch 2.x in repository https://gitbox.apache.org/repos/asf/nutch.git The following commit(s) were added to refs/heads/2.x by this push: new 90dede9 NUTCH-2667 Update Tika and Commons

[nutch] branch master updated (43d26ce -> 3ab0227)

2019-01-06 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a change to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git. from 43d26ce Merge pull request #407 from sebastian-nagel/NUTCH-2674-hostdb-dump-header add f79a5af NUTCH-2658 Add

[nutch] 01/01: Merge pull request #398 from jorgelbg/doc-indexer-links

2019-01-06 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git commit 3ab0227a65308312e4d73a49cd2ad08c3771b437 Merge: 43d26ce f79a5af Author: Sebastian Nagel AuthorDate: Sun Jan 6 12:17:25

[nutch] 01/01: Merge pull request #422 from sebastian-nagel/NUTCH-2657-http-headers-crlf

2019-01-06 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git commit 58ef2dac72ce7305e47eb87db1ee0b327373b36f Merge: 3ab0227 122648e Author: Sebastian Nagel AuthorDate: Sun Jan 6 12:52:45

[nutch] branch master updated (3ab0227 -> 58ef2da)

2019-01-06 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a change to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git. from 3ab0227 Merge pull request #398 from jorgelbg/doc-indexer-links add 122648e NUTCH-2657 Protocol-http to store

[nutch] 01/01: Merge pull request #371 from sebastian-nagel/NUTCH-2628-fetcher-signature

2019-01-06 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git commit 6274083b7ddc00df88e90237a1416b6fac8839bd Merge: 58ef2da 2ae86d3 Author: Sebastian Nagel AuthorDate: Sun Jan 6 20:40:29

[nutch] branch master updated (58ef2da -> 6274083)

2019-01-06 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a change to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git. from 58ef2da Merge pull request #422 from sebastian-nagel/NUTCH-2657-http-headers-crlf add 2ae86d3 NUTCH-2628

[nutch] branch master updated: NUTCH-2475 If and else-if branches has the same condition - remove duplicated condition to handle ftp status 451 (requested action aborted)

2019-01-06 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git The following commit(s) were added to refs/heads/master by this push: new d0a4abf NUTCH-2475 If and else-if branches has

[nutch] branch 2.x updated: NUTCH-2475 If and else-if branches has the same condition - remove duplicated condition to handle ftp status 451 (requested action aborted)

2019-01-06 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch 2.x in repository https://gitbox.apache.org/repos/asf/nutch.git The following commit(s) were added to refs/heads/2.x by this push: new e56f82d NUTCH-2475 If and else-if branches has the

[nutch] branch master updated: NUTCH-2663 Improve the JEXL syntax for getting values from the metadata/context

2019-01-18 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git The following commit(s) were added to refs/heads/master by this push: new b2ec5c4 NUTCH-2663 Improve the JEXL syntax for

[nutch] branch master updated: NUTCH-2680 Documentation: https supported by multiple protocol plugins not only httpclient Improve description of property plugin.includes: - https is supported by defau

2019-01-18 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git The following commit(s) were added to refs/heads/master by this push: new 9ae7a80 NUTCH-2680 Documentation: https

[nutch] branch master updated: NUTCH-2682 Upgrade to Tika 1.20 - upgrade to Tika dependencies to version 1.20 - plugin parse-tika: add exclusions of transitive dependencies already provided as Nutch c

2019-01-21 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git The following commit(s) were added to refs/heads/master by this push: new 784aa5f NUTCH-2682 Upgrade to Tika 1.20

[nutch] branch master updated: NUTCH-2685: README.md file for exchange-jexl plugin.

2019-01-22 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git The following commit(s) were added to refs/heads/master by this push: new 636f576 NUTCH-2685: README.md file for exchange

[nutch] branch master updated: NUTCH-2691: Improve logging from scoring-depth plugin

2019-01-29 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git The following commit(s) were added to refs/heads/master by this push: new 010c2fc NUTCH-2691: Improve logging from

[nutch] branch master updated: NUTCH-2689 Speed up urlfilter-regex and urlfilter-automaton - do not extract host and domain name from the URL if not needed - speed up regular expressions: - use non-ca

2019-01-29 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git The following commit(s) were added to refs/heads/master by this push: new f87b19b NUTCH-2689 Speed up urlfilter-regex and

[nutch] branch master updated (33922fe -> fd31cea)

2019-02-22 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a change to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git. from 33922fe NUTCH-2694 HostDB to aggregate by long instead of integer new 3abe7db NUTCH-2695: fix some alerts

[nutch] branch master updated: NUTCH-2627 Fetcher to optionally filter URLs - filter and normalize URLs in QueueFeeder if fetcher.filter.urls resp. fetcher.normalize.urls are true (default is false, i

2019-02-22 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git The following commit(s) were added to refs/heads/master by this push: new 546237d NUTCH-2627 Fetcher to optionally filter

[nutch] branch master updated: NUTCH-2693 Misspelled configuration property names in documentation - fix wrong names of Nutch configuration properties in documentation (nutch-default.xml and Java comm

2019-02-22 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git The following commit(s) were added to refs/heads/master by this push: new 4787d40 NUTCH-2693 Misspelled configuration

[nutch] branch master updated (e95c915 -> 78af89f)

2019-02-22 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a change to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git. from e95c915 Merge pull request #437 from sebastian-nagel/NUTCH-2693-misspelled-properties new a326284 NUTCH-2684

[nutch] branch master updated: NUTCH-2676 Update to the latest selenium and add code to use chrome and firefox headless mode with the remote web driver NUTCH-2460 use the headless option of firefox an

2019-02-23 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git The following commit(s) were added to refs/heads/master by this push: new 8f421a4 NUTCH-2676 Update to the latest

[nutch] 01/01: Revert "NUTCH-2697: Upgrade Ivy to fix the issue of an unset packaging.type property. (#441)"

2019-03-02 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch revert-441-NUTCH-2697 in repository https://gitbox.apache.org/repos/asf/nutch.git commit 5fc56b62d866af00befcae022718c9e9f879eec3 Author: Sebastian Nagel AuthorDate: Sat Mar 2 15:35:38 2019

[nutch] branch revert-441-NUTCH-2697 created (now 5fc56b6)

2019-03-02 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a change to branch revert-441-NUTCH-2697 in repository https://gitbox.apache.org/repos/asf/nutch.git. at 5fc56b6 Revert "NUTCH-2697: Upgrade Ivy to fix the issue of an unset packaging.type property.

[nutch] 01/01: Merge pull request #442 from apache/revert-441-NUTCH-2697

2019-03-02 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git commit 2ca3d89d361e1db94b174804393e259fc389e84e Merge: 0b0fcea 5fc56b6 Author: Sebastian Nagel AuthorDate: Sat Mar 2 15:45:32

[nutch] branch master updated (0b0fcea -> 2ca3d89)

2019-03-02 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a change to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git. from 0b0fcea NUTCH-2697: Upgrade Ivy to fix the issue of an unset packaging.type property. (#441) add 5fc56b6

[nutch] branch master updated: NUTCH-2683 DeduplicationJob: add option to prefer https:// over http:// - add optional value "httpsOverHttp" to -compareOrder argument to prefer https:// over http:// if

2019-04-10 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git The following commit(s) were added to refs/heads/master by this push: new 3958d0c NUTCH-2683 DeduplicationJob: add option

<    1   2   3   4   5   6   7   8   9   10   >