(nutch) branch master updated: NUTCH-3055 README: fix Github "hub" commands - replace "git" with "hub" were necessary - improve formatting of "contributing" steps

2024-05-28 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git The following commit(s) were added to refs/heads/master by this push: new ca03d9b76 NUTCH-3055 README: fix Github &quo

(nutch) branch master updated (8abc78a65 -> bfa07df29)

2024-05-28 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a change to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git from 8abc78a65 NUTCH-3041 Address confusing logging in o.a.n.net.URLExemptionFilters (#813) add 4b263533a NUTCH-3044

(nutch) 01/01: Merge pull request #815 from sebastian-nagel/NUTCH-3044-generator-npe

2024-05-28 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git commit bfa07df29f7b810365620abff06680eac9bcddf9 Merge: 8abc78a65 b153279ad Author: Sebastian Nagel AuthorDate: Tue May 28 13:55

(nutch) branch master updated: NUTCH-3043 Generator: count URLs rejected by URL filters (#814)

2024-05-14 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git The following commit(s) were added to refs/heads/master by this push: new 5f1330a03 NUTCH-3043 Generator: count URLs

(nutch) branch master updated: NUTCH-3039 Failure to handle ftp:// URLs

2024-05-14 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git The following commit(s) were added to refs/heads/master by this push: new ea9c7ee5d NUTCH-3039 Failure to handle ftp

(nutch-site) branch asf-site updated: Revert incorrect change in doap.rdf (see #2)

2024-05-11 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch asf-site in repository https://gitbox.apache.org/repos/asf/nutch-site.git The following commit(s) were added to refs/heads/asf-site by this push: new 8456fb5 Revert incorrect change

(nutch-site) branch asf-staging updated: Revert incorrect change in doap.rdf (see #2)

2024-05-11 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch asf-staging in repository https://gitbox.apache.org/repos/asf/nutch-site.git The following commit(s) were added to refs/heads/asf-staging by this push: new d7ac03a Revert incorrect change

(nutch-site) branch main updated: Revert incorrect change (#2)

2024-05-11 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch main in repository https://gitbox.apache.org/repos/asf/nutch-site.git The following commit(s) were added to refs/heads/main by this push: new c011a7e Revert incorrect change (#2) c011a7e

(nutch) branch master updated: NUTCH-3008 indexer-elastic: downgrade to ES 7.10.2 to address licensing issues

2024-03-14 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git The following commit(s) were added to refs/heads/master by this push: new 367988dfd NUTCH-3008 indexer-elastic: downgrade

(nutch) branch master updated: Update crawl documentation

2024-03-10 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git The following commit(s) were added to refs/heads/master by this push: new 83acd501e Update crawl documentation 83acd501e

(nutch) branch master updated (adadc43fb -> 7ad382d95)

2023-11-08 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a change to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git from adadc43fb Merge branch 'NUTCH-3017', closes #793 new d8e66ce87 [NUTCH-3025^Curlfilter-fast to filter based

(nutch) 02/02: Merge branch 'NUTCH-3017', closes #793

2023-11-08 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git commit adadc43fb169793c47ab25a0eba99a5f20eda763 Merge: 90849124d ac383fc51 Author: Sebastian Nagel AuthorDate: Wed Nov 8 13:35

(nutch) 01/02: [NUTCH-3017] Allow fast-urlfilter to load from HDFS/S3 and support gzipped input - use Hadoop-provided compression codecs - update description of property urlfilter.fast.file

2023-11-08 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git commit ac383fc5125b6c114a23ef996558ead57e873970 Author: Sebastian Nagel AuthorDate: Wed Nov 8 12:24:24 2023 +0100 [NUTCH

(nutch) branch master updated (90849124d -> adadc43fb)

2023-11-08 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a change to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git from 90849124d NUTCH-3020 -- ParseSegment should check for okhttp's truncation flag (#794) add d1025fd63 [NUTCH-3017

[nutch] branch master updated: NUTCH-3012 SegmentReader when dumping with option -recode: NPE on unparsed documents - fall back to UTF-8 when stringifying the content of unparsed documents

2023-10-21 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git The following commit(s) were added to refs/heads/master by this push: new d2c3e96d8 NUTCH-3012 SegmentReader when dumping

[nutch] branch master updated: NUTCH-3011 HttpRobotRulesParser: handle HTTP 429 Too Many Requests same as server errors (HTTP 5xx)

2023-10-21 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git The following commit(s) were added to refs/heads/master by this push: new b081c75d8 NUTCH-3011 HttpRobotRulesParser

[nutch] branch master updated: NUTCH-2990 HttpRobotRulesParser to follow 5 redirects as specified by RFC 9309 (#779)

2023-10-21 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git The following commit(s) were added to refs/heads/master by this push: new ecdd19dbd NUTCH-2990 HttpRobotRulesParser

[nutch] branch master updated: NUTCH-3009 Upgrade to Hadoop 3.3.6

2023-10-21 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git The following commit(s) were added to refs/heads/master by this push: new bb68385f9 NUTCH-3009 Upgrade to Hadoop 3.3.6

[nutch] branch master updated: NUTCH-3002 Protocol-okhttp HttpResponse: HTTP header metadata lookup should be case-insensitive - implement class CaseInsensitiveMetadata providing case-insensitive me

2023-10-21 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git The following commit(s) were added to refs/heads/master by this push: new e96cfc56e NUTCH-3002 Protocol-okhttp

[nutch] branch master updated (a1ab4333e -> a74b57b90)

2023-10-03 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a change to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git from a1ab4333e NUTCH-2897 Do not supress deprecated API warnings - deprecate constructor of NutchJob - remove deprocated

[nutch] branch master updated: NUTCH-2897 Do not supress deprecated API warnings - deprecate constructor of NutchJob - remove deprocated call to Object.finalize() from Plugin.finalize()

2023-10-03 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git The following commit(s) were added to refs/heads/master by this push: new a1ab4333e NUTCH-2897 Do not supress deprecated

[nutch] branch master updated: NUTCH-3010 Injector: count unique number of injected URLs - add counter urls_injected_unique - improve log messages reporting the counts of injected/merged URLs

2023-10-02 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git The following commit(s) were added to refs/heads/master by this push: new 810b1d6ad NUTCH-3010 Injector: count unique

[nutch] branch master updated (417b87732 -> a72a53a32)

2023-09-30 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a change to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git from 417b87732 NUTCH-2852 SpotBugs: Method invokes System.exit(...) - remove all calls of System.exit(...) in methods

[nutch] branch master updated: NUTCH-2852 SpotBugs: Method invokes System.exit(...) - remove all calls of System.exit(...) in methods except main(args) of various "checker" tools

2023-09-30 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git The following commit(s) were added to refs/heads/master by this push: new 417b87732 NUTCH-2852 SpotBugs: Method invokes

[nutch] branch master updated: NUTCH-2997 Add Override annotations

2023-08-22 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git The following commit(s) were added to refs/heads/master by this push: new 0fae6b59f NUTCH-2997 Add Override annotations

[nutch] branch master updated: NUTCH-2996 Use new SimpleRobotRulesParser API entry point crawler-commons 1.4

2023-08-22 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git The following commit(s) were added to refs/heads/master by this push: new 070c115cf NUTCH-2996 Use new

[nutch] branch master updated: NUTCH-2995 Upgrade to crawler-commons 1.4

2023-08-22 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git The following commit(s) were added to refs/heads/master by this push: new a24ec5c5b NUTCH-2995 Upgrade to crawler-commons

[nutch] branch master updated: NUTCH-2993 ScoringDepth plugin to skip depth check based on URL Pattern - apply patch contributed by Markus Jelsma

2023-08-22 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git The following commit(s) were added to refs/heads/master by this push: new eae3c52a8 NUTCH-2993 ScoringDepth plugin to skip

[nutch-site] branch asf-staging updated: Add logo on URL path where requested README.md in source code repository

2023-08-04 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch asf-staging in repository https://gitbox.apache.org/repos/asf/nutch-site.git The following commit(s) were added to refs/heads/asf-staging by this push: new e1f939c Add logo on URL path

[nutch-site] branch main updated: Add logo on URL path where requested README.md in source code repository

2023-08-04 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch main in repository https://gitbox.apache.org/repos/asf/nutch-site.git The following commit(s) were added to refs/heads/main by this push: new c80dcca Add logo on URL path where requested

[nutch-site] branch asf-site updated: Add logo on URL path where requested README.md in source code repository

2023-08-04 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch asf-site in repository https://gitbox.apache.org/repos/asf/nutch-site.git The following commit(s) were added to refs/heads/asf-site by this push: new 3962502 Add logo on URL path where

[nutch-site] branch asf-site updated: Add link to ASF privacy policies

2023-07-20 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch asf-site in repository https://gitbox.apache.org/repos/asf/nutch-site.git The following commit(s) were added to refs/heads/asf-site by this push: new c252fd7 Add link to ASF privacy

[nutch-site] branch main updated: Add link to ASF privacy policies

2023-07-20 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch main in repository https://gitbox.apache.org/repos/asf/nutch-site.git The following commit(s) were added to refs/heads/main by this push: new d0832c1 Add link to ASF privacy policies

[nutch-site] branch asf-staging updated: Add link to ASF privacy policies

2023-07-20 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch asf-staging in repository https://gitbox.apache.org/repos/asf/nutch-site.git The following commit(s) were added to refs/heads/asf-staging by this push: new 3ff0ddb Add link to ASF privacy

[nutch-site] 01/03: - add link / banner of Apache conferences or events - rename and move link to ASF

2023-07-20 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch main in repository https://gitbox.apache.org/repos/asf/nutch-site.git commit 7cd1d1cce957346615a0cb1efbfd875932764d70 Author: Sebastian Nagel AuthorDate: Thu Jul 20 10:32:50 2023 +0200

[nutch-site] 03/03: Add new committer / PMC

2023-07-20 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch main in repository https://gitbox.apache.org/repos/asf/nutch-site.git commit db7208f4333d1208516db09b3ac4309d9402881c Author: Sebastian Nagel AuthorDate: Thu Jul 20 10:36:26 2023 +0200 Add

[nutch-site] 02/03: Update copyright year 2022 -> 2023

2023-07-20 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch main in repository https://gitbox.apache.org/repos/asf/nutch-site.git commit 44463bd9e75c654d775d9337989e46e75359ed1a Author: Sebastian Nagel AuthorDate: Thu Jul 20 10:35:46 2023 +0200

[nutch-site] branch asf-site updated: - add new committer / PMC - update copyright year 2022 -> 2023 - add link / banner of Apache conferences or events - rename and move link to ASF

2023-07-20 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch asf-site in repository https://gitbox.apache.org/repos/asf/nutch-site.git The following commit(s) were added to refs/heads/asf-site by this push: new 773089d - add new committer / PMC

[nutch-site] branch main updated (aa45c17 -> db7208f)

2023-07-20 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a change to branch main in repository https://gitbox.apache.org/repos/asf/nutch-site.git from aa45c17 Announce release of Nutch 1.19 - fix release data in announcement new 7cd1d1c - add link / banner

[nutch-site] branch asf-staging updated: - add new committer / PMC - update copyright year 2022 -> 2023 - add link / banner of Apache conferences or events - rename and move link to ASF

2023-07-20 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch asf-staging in repository https://gitbox.apache.org/repos/asf/nutch-site.git The following commit(s) were added to refs/heads/asf-staging by this push: new a864887 - add new committer

[nutch] branch master updated: NUTCH-2991 Support HTTP/S Header Authorization for Solr connections (#763)

2023-06-06 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git The following commit(s) were added to refs/heads/master by this push: new 9109bdd74 NUTCH-2991 Support HTTP/S Header

[nutch] branch master updated: NUTCH-2992 Fetcher: always block fetch queues when exceptions threshold is reached - if QueueFeeder is still alive, also block queues which are empty right now

2023-05-23 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git The following commit(s) were added to refs/heads/master by this push: new 98d02e70f NUTCH-2992 Fetcher: always block fetch

[nutch] branch master updated: NUTCH-2596 Upgrade from org.mortbay.jetty to org.eclipse.jetty - upgrade from org.mortbay.jetty 6.1.26 to org.eclipse.jetty 9.4.50 (Hadoop depends on 9.4.43) - remove

2023-03-17 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git The following commit(s) were added to refs/heads/master by this push: new 215993bc6 NUTCH-2596 Upgrade from

[nutch] branch master updated: NUTCH-2984 Drop test proxy server and benchmark tool

2023-03-17 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git The following commit(s) were added to refs/heads/master by this push: new b4cb5c1e3 NUTCH-2984 Drop test proxy server

[nutch] branch master updated: NUTCH-2985 Disable plugin urlfilter-validator by default

2023-03-06 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git The following commit(s) were added to refs/heads/master by this push: new 1999b1e11 NUTCH-2985 Disable plugin urlfilter

[nutch] branch master updated: NUTCH-2983 nutch-default.xml improvements - remove property "hadoop.job.history.user.location", obsolete since Hadoop 0.21.0 - normalize spelling (case) of URL and Crawl

2023-03-06 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git The following commit(s) were added to refs/heads/master by this push: new c8aecfa5d NUTCH-2983 nutch-default.xml

[nutch] branch master updated: NUTCH-2972 Javadoc build fails using JDK 17 - fix Javadoc issues when building with JDK 17

2023-03-06 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git The following commit(s) were added to refs/heads/master by this push: new a92878df1 NUTCH-2972 Javadoc build fails using

[nutch] branch master updated: NUTCH-2982 Generator: parameter for URL normalization not passed forward - pass forward params `norm` and `maxNumSegments` - fix typos in Javadoc

2023-03-06 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git The following commit(s) were added to refs/heads/master by this push: new ef2949691 NUTCH-2982 Generator: parameter

[nutch] 01/07: NUTCH-2920 -- first working attempt at migrating ElasticsearchIndexWriter to OpenSearch

2023-03-06 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git commit ca3824fd98290dd7806752decfab6eb9e3b3b569 Author: tallison AuthorDate: Fri Feb 24 14:48:55 2023 -0500 NUTCH-2920

[nutch] 06/07: fix template to include new key store info. remove unused auth

2023-03-06 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git commit e03cad3f42b9be16f45b2012fc738106894ac332 Author: tallison AuthorDate: Wed Mar 1 15:34:08 2023 -0500 fix template

[nutch] 05/07: NUTCH-2920 -- improve username/pw logic and update README.md

2023-03-06 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git commit 71fabb2a87ff81b78997133ab7c790afa4ea6157 Author: tallison AuthorDate: Wed Mar 1 13:48:57 2023 -0500 NUTCH-2920

[nutch] 07/07: Add indexer-opensearch-1x to 4 more targets...feedback from sebastian-nagel

2023-03-06 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git commit e8fd21090c0a1e387ee3b5796b7a3be11cf91293 Author: tballison AuthorDate: Fri Mar 3 14:48:20 2023 -0500 Add indexer

[nutch] branch master updated (383aeca5d -> e8fd21090)

2023-03-06 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a change to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git from 383aeca5d NUTCH-2980: Upgraded Selenium to 4.7.2 + HTMLUnit new ca3824fd9 NUTCH-2920 -- first working attempt

[nutch] 03/07: NUTCH-2920 -- add keystore for 2-way tls; add back in no-tls option with a stern warning and possibly helpful links.

2023-03-06 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git commit f6b17177ad6049b5642d9510cb60fe0a1d3b5f1c Author: tallison AuthorDate: Wed Mar 1 12:16:17 2023 -0500 NUTCH-2920

[nutch] 04/07: NUTCH-2920 -- improve handling for missing trust.store.path in the index-writers.xml

2023-03-06 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git commit 5fc2839c447a1b3695b4bcb507d428d32ff27281 Author: tallison AuthorDate: Wed Mar 1 13:28:07 2023 -0500 NUTCH-2920

[nutch] 02/07: NUTCH-2920 -- fix imports

2023-03-06 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git commit 6e149f4954a0b7b21120b8e1467a07a82c60e66e Author: tallison AuthorDate: Fri Feb 24 15:22:16 2023 -0500 NUTCH-2920

[nutch] branch master updated: NUTCH-2980: Upgraded Selenium to 4.7.2 + HTMLUnit

2023-02-18 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git The following commit(s) were added to refs/heads/master by this push: new 383aeca5d NUTCH-2980: Upgraded Selenium to 4.7.2

[nutch] branch master updated: NUTCH-2974 Ant build fails with "Unparseable date" on certain platforms

2023-02-17 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git The following commit(s) were added to refs/heads/master by this push: new 541e6936d NUTCH-2974 Ant build fails

[nutch] branch master updated: NUTCH-2634 Some links marked as "nofollow" are followed anyway - fix detection of nofollow in multi-valued rel attributes

2023-01-08 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git The following commit(s) were added to refs/heads/master by this push: new dfdd00f31 NUTCH-2634 Some links marked

[nutch] branch master updated (85f7bcb63 -> ed7b6615b)

2022-09-11 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a change to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git from 85f7bcb63 Prepare for new development after release of 1.19 - bump version number (-> 1.20-NAPSHOT)

svn commit: r56776 - /release/nutch/1.18/

2022-09-10 Thread snagel
Author: snagel Date: Sat Sep 10 13:19:52 2022 New Revision: 56776 Log: Remove 1.18 after release of 1.19 Removed: release/nutch/1.18/

[nutch] 02/02: Prepare for new development after release of 1.19 - bump version number (-> 1.20-NAPSHOT)

2022-09-08 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git commit 85f7bcb63ee801bdfb0b41ca2555160583105ea2 Author: Sebastian Nagel AuthorDate: Thu Sep 8 16:28:27 2022 +0200 Prepare

[nutch] branch master updated (ffe059892 -> 85f7bcb63)

2022-09-08 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a change to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git from ffe059892 NUTCH-2969 Javadoc: Javascript search is not working when built on JDK 11 - pass --no-module-directories

[nutch] 01/02: Nutch 1.19 release - update current year in API docs etc. - update version number - add changes / release notes - update links to Hadoop API docs

2022-09-08 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git commit 27cf929b83ba86b896762dd4970e445069e514ae Author: Sebastian Nagel AuthorDate: Mon Aug 22 15:57:41 2022 +0200 Nutch

[nutch-site] 02/02: Announce release of Nutch 1.19 - fix release data in announcement

2022-09-08 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch main in repository https://gitbox.apache.org/repos/asf/nutch-site.git commit aa45c17bf678c601f4f691dfbdca77380aea5edd Author: Sebastian Nagel AuthorDate: Thu Sep 8 15:25:32 2022 +0200

[nutch-site] branch main updated (4efc5a9 -> aa45c17)

2022-09-08 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a change to branch main in repository https://gitbox.apache.org/repos/asf/nutch-site.git from 4efc5a9 NUTCH-1999 Add /robots.txt to Nutch site (#1) new 73e90d4 Announce release of Nutch 1.19 new

[nutch-site] branch asf-site updated: Announce release of Nutch 1.19 - fix release data in announcement

2022-09-08 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch asf-site in repository https://gitbox.apache.org/repos/asf/nutch-site.git The following commit(s) were added to refs/heads/asf-site by this push: new 956a142 Announce release of Nutch 1.19

[nutch-site] branch asf-staging updated: Announce release of Nutch 1.19 - fix release data in announcement

2022-09-08 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch asf-staging in repository https://gitbox.apache.org/repos/asf/nutch-site.git The following commit(s) were added to refs/heads/asf-staging by this push: new 87176ac Announce release

[nutch-site] branch asf-site updated (a41c7ef -> 314b1b2)

2022-09-08 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a change to branch asf-site in repository https://gitbox.apache.org/repos/asf/nutch-site.git from a41c7ef Add doap.rdf (lost during CMS migration) new 1e7bf4e - add README for branch asf-site - modify

[nutch-site] 02/03: Update content from Hugo build after adding Kube modified templates

2022-09-08 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch asf-site in repository https://gitbox.apache.org/repos/asf/nutch-site.git commit 45468fc2c2e83cfe1aef57f437ca02991c0256b3 Author: Sebastian Nagel AuthorDate: Thu Sep 8 10:54:37 2022 +0200

[nutch-site] 01/03: - add README for branch asf-site - modify .asf.yaml to contain only instructions required in branch asf-site

2022-09-08 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch asf-site in repository https://gitbox.apache.org/repos/asf/nutch-site.git commit 1e7bf4e9e7c2f5450444623847476d1b73d7b773 Author: Sebastian Nagel AuthorDate: Thu Sep 8 14:59:33 2022 +0200

[nutch-site] branch asf-staging updated (3e9e725 -> 2cfe00d)

2022-09-08 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a change to branch asf-staging in repository https://gitbox.apache.org/repos/asf/nutch-site.git discard 3e9e725 Announce release of Nutch 1.19 new 2cfe00d Announce release of Nutch 1.19 This update added

[nutch-site] branch asf-staging updated: Announce release of Nutch 1.19

2022-09-08 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch asf-staging in repository https://gitbox.apache.org/repos/asf/nutch-site.git The following commit(s) were added to refs/heads/asf-staging by this push: new 3e9e725 Announce release

svn commit: r56738 [1/3] - /release/nutch/1.19/CHANGES.txt

2022-09-08 Thread snagel
Author: snagel Date: Thu Sep 8 12:44:33 2022 New Revision: 56738 Log: Release Apache Nutch 1.19 - add change log Added: release/nutch/1.19/CHANGES.txt (with props)

svn commit: r56738 [3/3] - /release/nutch/1.19/CHANGES.txt

2022-09-08 Thread snagel
Propchange: release/nutch/1.19/CHANGES.txt -- svn:eol-style = native

svn commit: r56738 [2/3] - /release/nutch/1.19/CHANGES.txt

2022-09-08 Thread snagel
nutch +[NUTCH-2220] - Rename db.* options used only by the linkdb to linkdb.* + +Nutch 1.11 Release 03/12/2015 (dd/mm/) +Release Report: http://s.apache.org/nutch11 + +* NUTCH-2176 Clean up of log4j.properties (markus) + +* NUTCH-2107 plugin.xml to validate against plugin.dtd (snagel) + +

[nutch-site] branch main updated: NUTCH-1999 Add /robots.txt to Nutch site (#1)

2022-09-08 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch main in repository https://gitbox.apache.org/repos/asf/nutch-site.git The following commit(s) were added to refs/heads/main by this push: new 4efc5a9 NUTCH-1999 Add /robots.txt to Nutch

[nutch-site] branch asf-staging updated: - add README for branch asf-staging - modify .asf.yaml to contain only instructions required in branch asf-staging

2022-09-08 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch asf-staging in repository https://gitbox.apache.org/repos/asf/nutch-site.git The following commit(s) were added to refs/heads/asf-staging by this push: new b649bf6 - add README for branch

[nutch-site] branch NUTCH-1999-nutch-site-robots-txt updated (142489f -> f863c1f)

2022-09-08 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a change to branch NUTCH-1999-nutch-site-robots-txt in repository https://gitbox.apache.org/repos/asf/nutch-site.git omit 142489f NUTCH-1999 Add /robots.txt to Nutch site add 6e318e6 Add modified Kube

[nutch-site] branch asf-staging updated: Sync .asf.yaml file with main branch

2022-09-08 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch asf-staging in repository https://gitbox.apache.org/repos/asf/nutch-site.git The following commit(s) were added to refs/heads/asf-staging by this push: new ee7f0b2 Sync .asf.yaml file

[nutch-site] 01/01: Update content from Hugo build after adding Kube modified templates

2022-09-08 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch asf-staging in repository https://gitbox.apache.org/repos/asf/nutch-site.git commit d77dbb51645aa6a0249564730aba051ac5585a2e Author: Sebastian Nagel AuthorDate: Thu Sep 8 10:54:37 2022 +0200

[nutch-site] branch asf-staging created (now d77dbb5)

2022-09-08 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a change to branch asf-staging in repository https://gitbox.apache.org/repos/asf/nutch-site.git at d77dbb5 Update content from Hugo build after adding Kube modified templates This branch includes

svn commit: r56686 - /dev/nutch/1.19/ /release/nutch/1.19/

2022-09-06 Thread snagel
Author: snagel Date: Tue Sep 6 08:51:59 2022 New Revision: 56686 Log: Release Apache Nutch 1.19 Added: release/nutch/1.19/ - copied from r56685, dev/nutch/1.19/ Removed: dev/nutch/1.19/

svn commit: r56398 - /dev/nutch/1.19/

2022-08-22 Thread snagel
Author: snagel Date: Mon Aug 22 15:15:43 2022 New Revision: 56398 Log: Stage Apache Nutch 1.19 RC#1 Added: dev/nutch/1.19/ dev/nutch/1.19/apache-nutch-1.19-bin.tar.gz (with props) dev/nutch/1.19/apache-nutch-1.19-bin.tar.gz.asc dev/nutch/1.19/apache-nutch-1.19

[nutch] branch branch-1.19 created (now 63d4f11c0)

2022-08-22 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a change to branch branch-1.19 in repository https://gitbox.apache.org/repos/asf/nutch.git at 63d4f11c0 Nutch 1.19 release - update current year in API docs etc. - update version number - add changes

[nutch] annotated tag release-1.19 updated (63d4f11c0 -> 5d7660ceb)

2022-08-22 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a change to annotated tag release-1.19 in repository https://gitbox.apache.org/repos/asf/nutch.git *** WARNING: tag release-1.19 was modified! *** from 63d4f11c0 (commit) to 5d7660ceb (tag) tagging

[nutch] branch master updated: NUTCH-2969 Javadoc: Javascript search is not working when built on JDK 11 - pass --no-module-directories to javadoc target when building on JDK 11 - remove obsolete cond

2022-08-22 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git The following commit(s) were added to refs/heads/master by this push: new ffe059892 NUTCH-2969 Javadoc: Javascript search

[nutch] branch master updated (bca5fc0d0 -> 635ef2f3b)

2022-08-21 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a change to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git from bca5fc0d0 NUTCH-2795 CrawlDbReader: compress CrawlDb dumps if configured - configure CSV and JSON LineRecordWriters

[nutch] branch master updated (bec577d50 -> bca5fc0d0)

2022-08-21 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a change to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git from bec577d50 NUTCH-2863 Injector to parse command-line flags case-insensitive add bca5fc0d0 NUTCH-2795

[nutch] branch master updated: NUTCH-2962 Update and complete package info of protocol plugins

2022-08-19 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git The following commit(s) were added to refs/heads/master by this push: new 6f4c80b7f NUTCH-2962 Update and complete package

[nutch] branch master updated: NUTCH-2930 Protocol-okhttp: implement IP filter (#736)

2022-08-19 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git The following commit(s) were added to refs/heads/master by this push: new 7e969eaec NUTCH-2930 Protocol-okhttp: implement

[nutch] branch master updated (c0f723e99 -> 05afebd03)

2022-08-19 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a change to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git from c0f723e99 NUTCH-2957 indexer-solr / Solr schema.xml - add fall-back field definitions for unknown index fields

[nutch] branch master updated (edebfe49f -> c0f723e99)

2022-08-17 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a change to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git from edebfe49f NUTCH-2955 indexer-solr: replace deprecated/removed field type solr.LatLonType add c0f723e99 NUTCH

[nutch] branch master updated (a5a630055 -> edebfe49f)

2022-08-17 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a change to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git from a5a630055 Merge pull request #729 from sebastian-nagel/NUTCH-2947-keep-stateful-fetch-queues add edebfe49f NUTCH

[nutch] branch master updated (82f9530dc -> a5a630055)

2022-08-15 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a change to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git from 82f9530dc Merge pull request #697 from sebastian-nagel/NUTCH-2896-okhttp-connection-pool new c862d2409 NUTCH

[nutch] branch master updated (b7b834501 -> 82f9530dc)

2022-08-15 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a change to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git from b7b834501 NUTCH-2958 Upgrade to crawler-commons 1.3 (#740) new af44bcb6f NUTCH-2896 Protocol-okhttp: make

[nutch] branch master updated (8fc4f17ac -> b7b834501)

2022-08-12 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a change to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git from 8fc4f17ac NUTCH-2956 index-geoip: dependency upgrades and improvements - upgrade to geoip2 3.0.1 - exclude transitive

[nutch] branch master updated: NUTCH-2956 index-geoip: dependency upgrades and improvements - upgrade to geoip2 3.0.1 - exclude transitive dependencies (Jackson) provided as Nutch core deps - read als

2022-08-09 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git The following commit(s) were added to refs/heads/master by this push: new 8fc4f17ac NUTCH-2956 index-geoip: dependency

[nutch] branch master updated: NUTCH-2953 Indexer Elastic to ignore SSL issues - apply patch contributed by Markus Jelsma - fix class imports

2022-08-09 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git The following commit(s) were added to refs/heads/master by this push: new 01ab00b6c NUTCH-2953 Indexer Elastic to ignore

[nutch] branch master updated: NUTCH-2952 Upgrade core dependencies - Hadoop 3.1.3 -> 3.3.3 - log4j 2.17.0 -> 2.17.2 - and some more

2022-08-09 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git The following commit(s) were added to refs/heads/master by this push: new e71841fd0 NUTCH-2952 Upgrade core dependencies

  1   2   3   4   5   6   7   8   9   >