dev
Thread
Date
Earlier messages
Messages by Thread
[jira] [Resolved] (NUTCH-2812) Methods returning array may expose internal representation
Sebastian Nagel (Jira)
[jira] [Resolved] (NUTCH-1942) Remove TopLevelDomain
Sebastian Nagel (Jira)
[jira] [Resolved] (NUTCH-1806) Delegate processing of URL domains to crawler commons
Sebastian Nagel (Jira)
Build failed in Jenkins: Nutch » Nutch-trunk #167
Apache Jenkins Server
[jira] [Resolved] (NUTCH-3058) Fetcher: counter for hung threads
Sebastian Nagel (Jira)
[jira] [Commented] (NUTCH-3059) Generator: selector job does not count reduce output records
Sebastian Nagel (Jira)
[jira] [Commented] (NUTCH-3059) Generator: selector job does not count reduce output records
Sebastian Nagel (Jira)
[jira] [Resolved] (NUTCH-3061) URL filters to log name of the rule file rules are read from
Sebastian Nagel (Jira)
[jira] [Resolved] (NUTCH-3062) protocol-okhttp: optionally record HTTP and SSL/TLS versions
Sebastian Nagel (Jira)
[jira] [Resolved] (NUTCH-3065) Format changelog as Markdown
Sebastian Nagel (Jira)
[jira] [Resolved] (NUTCH-3066) Protocol plugin unit tests fail randomly
Sebastian Nagel (Jira)
[jira] [Resolved] (NUTCH-3057) Arbitrary indexer "leaks" previous value into a field processed after an exception
Joe Gilvary (Jira)
[jira] [Commented] (NUTCH-3064) Upgrade com.maxmind.geoip2:geoip2 dependency in geoip-index to v4.2.0
ASF GitHub Bot (Jira)
[PR] WIP NUTCH-3064 Upgrade com.maxmind.geoip2:geoip2 dependency in geoip-index to v4.2.0 [nutch]
via GitHub
[jira] [Commented] (NUTCH-3066) Protocol plugin unit tests fail randomly
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3066) Protocol plugin unit tests fail randomly
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3066) Protocol plugin unit tests fail randomly
Hudson (Jira)
Re: [PR] NUTCH-3066 Protocol plugin unit tests fail randomly [nutch]
via GitHub
Re: [PR] NUTCH-3066 Protocol plugin unit tests fail randomly [nutch]
via GitHub
[jira] [Created] (NUTCH-3067) Improve performance of FetchItemQueues if error state is preserved
Sebastian Nagel (Jira)
[jira] [Resolved] (NUTCH-3063) Support for "addBinaryContent" from REST API
Sebastian Nagel (Jira)
[jira] [Commented] (NUTCH-3065) Format changelog as Markdown
Sebastian Nagel (Jira)
[jira] [Commented] (NUTCH-3065) Format changelog as Markdown
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3065) Format changelog as Markdown
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3065) Format changelog as Markdown
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3065) Format changelog as Markdown
Hudson (Jira)
[PR] NUTCH-3065 Format changelog as markdown [nutch]
via GitHub
Re: [PR] NUTCH-3065 Format changelog as markdown [nutch]
via GitHub
Re: [PR] NUTCH-3065 Format changelog as markdown [nutch]
via GitHub
Re: [PR] NUTCH-3065 Format changelog as markdown [nutch]
via GitHub
[jira] [Assigned] (NUTCH-3065) Format changelog as Markdown
Sebastian Nagel (Jira)
[jira] [Created] (NUTCH-3065) Format changelog as Markdown
Sebastian Nagel (Jira)
[jira] [Work started] (NUTCH-3064) Upgrade com.maxmind.geoip2:geoip2 dependency in geoip-index to v4.2.0
Lewis John McGibbney (Jira)
[jira] [Created] (NUTCH-3064) Upgrade com.maxmind.geoip2:geoip2 dependency in geoip-index to v4.2.0
Lewis John McGibbney (Jira)
[jira] [Commented] (NUTCH-3060) Javadoc link broken on website
Sebastian Nagel (Jira)
[jira] [Commented] (NUTCH-3063) Support for "addBinaryContent" from REST API
Lewis John McGibbney (Jira)
[jira] [Commented] (NUTCH-3063) Support for "addBinaryContent" from REST API
Lewis John McGibbney (Jira)
[jira] [Commented] (NUTCH-3063) Support for "addBinaryContent" from REST API
Sebastian Nagel (Jira)
[jira] [Commented] (NUTCH-3063) Support for "addBinaryContent" from REST API
Hudson (Jira)
[jira] [Assigned] (NUTCH-3063) Support for "addBinaryContent" from REST API
Lewis John McGibbney (Jira)
[jira] [Updated] (NUTCH-3063) Support for "addBinaryContent" from REST API
Isabelle Giguere (Jira)
[jira] [Updated] (NUTCH-3063) Support for "addBinaryContent" from REST API
Isabelle Giguere (Jira)
[jira] [Updated] (NUTCH-3063) Support for "addBinaryContent" from REST API
Isabelle Giguere (Jira)
[jira] [Updated] (NUTCH-3063) Support for "addBinaryContent" from REST API
Isabelle Giguere (Jira)
New JIRA issue - please assign to me
Isabelle Giguere via dev
[jira] [Created] (NUTCH-3063) Support for "addBinaryContent" from REST API
Isabelle Giguere (Jira)
[jira] [Commented] (NUTCH-3062) protocol-okhttp: optionally record HTTP and SSL/TLS versions
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3062) protocol-okhttp: optionally record HTTP and SSL/TLS versions
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3062) protocol-okhttp: optionally record HTTP and SSL/TLS versions
Hudson (Jira)
[PR] NUTCH-3062 protocol-okhttp: optionally record HTTP and SSL/TLS versions [nutch]
via GitHub
Re: [PR] NUTCH-3062 protocol-okhttp: optionally record HTTP and SSL/TLS versions [nutch]
via GitHub
[jira] [Created] (NUTCH-3062) protocol-okhttp: optionally record HTTP and SSL/TLS versions
Sebastian Nagel (Jira)
[jira] [Commented] (NUTCH-3061) URL filters to log name of the rule file rules are read from
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3061) URL filters to log name of the rule file rules are read from
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3061) URL filters to log name of the rule file rules are read from
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3061) URL filters to log name of the rule file rules are read from
Hudson (Jira)
[PR] NUTCH-3061 URL filters to log name of the rules file [nutch]
via GitHub
Re: [PR] NUTCH-3061 URL filters to log name of the rules file [nutch]
via GitHub
Re: [PR] NUTCH-3061 URL filters to log name of the rules file [nutch]
via GitHub
[jira] [Created] (NUTCH-3061) URL filters to log name of the rule file rules are read from
Sebastian Nagel (Jira)
[jira] [Created] (NUTCH-3060) Javadoc link broken on website
Sebastian Nagel (Jira)
[jira] [Updated] (NUTCH-3060) Javadoc link broken on website
Sebastian Nagel (Jira)
[jira] [Updated] (NUTCH-3060) Javadoc link broken on website
Sebastian Nagel (Jira)
[jira] [Commented] (NUTCH-3056) Injector to support resolving seed URLs
Markus Jelsma (Jira)
[jira] [Created] (NUTCH-3059) Generator: selector job does not count reduce output records
Sebastian Nagel (Jira)
[PR] NUTCH-3058 Fetcher: counter for hung threads [nutch]
via GitHub
Re: [PR] NUTCH-3058 Fetcher: counter for hung threads [nutch]
via GitHub
Re: [PR] NUTCH-3058 Fetcher: counter for hung threads [nutch]
via GitHub
Re: [PR] NUTCH-3058 Fetcher: counter for hung threads [nutch]
via GitHub
Re: [PR] NUTCH-3058 Fetcher: counter for hung threads [nutch]
via GitHub
Re: [PR] NUTCH-3058 Fetcher: counter for hung threads [nutch]
via GitHub
Re: [PR] NUTCH-3058 Fetcher: counter for hung threads [nutch]
via GitHub
[jira] [Commented] (NUTCH-3058) Fetcher: counter for hung threads
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3058) Fetcher: counter for hung threads
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3058) Fetcher: counter for hung threads
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3058) Fetcher: counter for hung threads
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3058) Fetcher: counter for hung threads
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3058) Fetcher: counter for hung threads
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3058) Fetcher: counter for hung threads
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3058) Fetcher: counter for hung threads
Hudson (Jira)
[jira] [Created] (NUTCH-3058) Fetcher: counter for hung threads
Sebastian Nagel (Jira)
[jira] [Resolved] (NUTCH-3055) README: fix Github "hub" commands
Sebastian Nagel (Jira)
Re: [PR] NUTCH-3055 README: fix Github "hub" commands [nutch]
via GitHub
[jira] [Resolved] (NUTCH-3044) Generator: NPE when extracting the host part of a URL fails
Sebastian Nagel (Jira)
[PR] NUTCH-3057 - Fix for index-arbitrary plugin improper retention and us… [nutch]
via GitHub
Re: [PR] NUTCH-3057 - Fix for index-arbitrary plugin improper retention and us… [nutch]
via GitHub
Re: [PR] NUTCH-3057 - Fix for index-arbitrary plugin improper retention and us… [nutch]
via GitHub
Re: [PR] NUTCH-3057 - Fix for index-arbitrary plugin improper retention and us… [nutch]
via GitHub
Re: [PR] NUTCH-3057 - Fix for index-arbitrary plugin improper retention and us… [nutch]
via GitHub
Re: [PR] NUTCH-3057 - Fix for index-arbitrary plugin improper retention and us… [nutch]
via GitHub
Re: [PR] NUTCH-3057 - Fix for index-arbitrary plugin improper retention and us… [nutch]
via GitHub
[jira] [Commented] (NUTCH-3057) Arbitrary indexer "leaks" previous value into a field processed after an exception
Joe Gilvary (Jira)
[jira] [Commented] (NUTCH-3057) Arbitrary indexer "leaks" previous value into a field processed after an exception
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3057) Arbitrary indexer "leaks" previous value into a field processed after an exception
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3057) Arbitrary indexer "leaks" previous value into a field processed after an exception
Joe Gilvary (Jira)
[jira] [Commented] (NUTCH-3057) Arbitrary indexer "leaks" previous value into a field processed after an exception
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3057) Arbitrary indexer "leaks" previous value into a field processed after an exception
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3057) Arbitrary indexer "leaks" previous value into a field processed after an exception
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3057) Arbitrary indexer "leaks" previous value into a field processed after an exception
Hudson (Jira)
[jira] [Commented] (NUTCH-3057) Arbitrary indexer "leaks" previous value into a field processed after an exception
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3057) Arbitrary indexer "leaks" previous value into a field processed after an exception
ASF GitHub Bot (Jira)
[jira] [Created] (NUTCH-3057) Arbitrary indexer "leaks" previous value into a field processed after an exception
Joe Gilvary (Jira)
[jira] [Updated] (NUTCH-3056) Injector to support resolving seed URLs
Markus Jelsma (Jira)
[jira] [Updated] (NUTCH-3056) Injector to support resolving seed URLs
Markus Jelsma (Jira)
[jira] [Updated] (NUTCH-3056) Injector to support resolving seed URLs
Markus Jelsma (Jira)
[jira] [Created] (NUTCH-3056) Injector to support resolving seed URLs
Markus Jelsma (Jira)
[jira] [Closed] (NUTCH-3041) Address confusing logging in o.a.n.net.URLExemptionFilters
Lewis John McGibbney (Jira)
[jira] [Work stopped] (NUTCH-3041) Address confusing logging in o.a.n.net.URLExemptionFilters
Lewis John McGibbney (Jira)
[jira] [Resolved] (NUTCH-3041) Address confusing logging in o.a.n.net.URLExemptionFilters
Lewis John McGibbney (Jira)
[jira] [Resolved] (NUTCH-3043) Generator: count URLs rejected by URL filters
Sebastian Nagel (Jira)
[jira] [Resolved] (NUTCH-3039) Failure to handle ftp:// URLs
Sebastian Nagel (Jira)
Community over Code EU 2024: The countdown has started!
Ryan Skraba
[PR] Revert incorrect change [nutch-site]
via GitHub
Re: [PR] Revert incorrect change [nutch-site]
via GitHub
Re: [PR] Revert incorrect change [nutch-site]
via GitHub
Re: [PR] Revert incorrect change [nutch-site]
via GitHub
[jira] [Commented] (NUTCH-585) [PARSE-HTML plugin] Block certain parts of HTML code from being indexed
Joe Gilvary (Jira)
[jira] [Closed] (NUTCH-3054) Address deprecation of Node16 for all GitHub Actions
Lewis John McGibbney (Jira)
[jira] [Resolved] (NUTCH-3054) Address deprecation of Node16 for all GitHub Actions
Lewis John McGibbney (Jira)
Re: [PR] NUTCH-3054 Address deprecation of Node16 for all GitHub Actions [nutch]
via GitHub
[jira] [Commented] (NUTCH-3055) README: fix Github "hub" commands
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3055) README: fix Github "hub" commands
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3055) README: fix Github "hub" commands
Hudson (Jira)
[jira] [Created] (NUTCH-3055) README: fix Github "hub" commands
Sebastian Nagel (Jira)
[jira] [Commented] (NUTCH-3045) Upgrade from Java 11 to 17
Sebastian Nagel (Jira)
[jira] [Commented] (NUTCH-3054) Address deprecation of Node16 for all GitHub Actions
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3054) Address deprecation of Node16 for all GitHub Actions
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3054) Address deprecation of Node16 for all GitHub Actions
Hudson (Jira)
[jira] [Updated] (NUTCH-3054) Address deprecation of Node16 for all GitHub Actions
Lewis John McGibbney (Jira)
[jira] [Work started] (NUTCH-3054) Address deprecation of Node16 for all GitHub Actions
Lewis John McGibbney (Jira)
[jira] [Created] (NUTCH-3054) Address deprecation of Node16 for all GitHub Actions
Lewis John McGibbney (Jira)
[jira] [Commented] (NUTCH-3049) Investigate using Records
Lewis John McGibbney (Jira)
[jira] [Created] (NUTCH-3053) Upgrade build and CI to JDK17
Lewis John McGibbney (Jira)
[jira] [Created] (NUTCH-3052) Investigate using sealed classes
Lewis John McGibbney (Jira)
[jira] [Created] (NUTCH-3051) Investigate using new pattern matching syntax in switch expressions
Lewis John McGibbney (Jira)
[jira] [Created] (NUTCH-3050) Investigate use of the enhanced instanceof operator
Lewis John McGibbney (Jira)
[jira] [Created] (NUTCH-3049) Investigate using Records
Lewis John McGibbney (Jira)
[jira] [Created] (NUTCH-3048) Investigate where/if new string utility methods could be used
Lewis John McGibbney (Jira)
[jira] [Created] (NUTCH-3047) Use multi-line text blocks
Lewis John McGibbney (Jira)
[jira] [Updated] (NUTCH-3046) Use compact strings
Lewis John McGibbney (Jira)
[jira] [Commented] (NUTCH-1806) Delegate processing of URL domains to crawler commons
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-1806) Delegate processing of URL domains to crawler commons
Sebastian Nagel (Jira)
[jira] [Commented] (NUTCH-1806) Delegate processing of URL domains to crawler commons
Markus Jelsma (Jira)
[jira] [Commented] (NUTCH-1806) Delegate processing of URL domains to crawler commons
Sebastian Nagel (Jira)
[jira] [Commented] (NUTCH-1806) Delegate processing of URL domains to crawler commons
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-1806) Delegate processing of URL domains to crawler commons
Hudson (Jira)
[PR] NUTCH-1806 Delegate processing of URL domains to crawler-common [nutch]
via GitHub
Re: [PR] NUTCH-1806 Delegate processing of URL domains to crawler-commons [nutch]
via GitHub
[jira] [Created] (NUTCH-3046) Use compact strings
Lewis John McGibbney (Jira)
[jira] [Created] (NUTCH-3045) Upgrade from Java 11 to 17
Lewis John McGibbney (Jira)
[ANNOUNCE] Apache Nutch 1.20 Release
lewis john mcgibbney
Re: [PR] NUTCH-3044 Generator: NPE when extracting the host part of a URL fails [nutch]
via GitHub
Re: [PR] NUTCH-3044 Generator: NPE when extracting the host part of a URL fails [nutch]
via GitHub
Re: [PR] NUTCH-3044 Generator: NPE when extracting the host part of a URL fails [nutch]
via GitHub
Re: [PR] NUTCH-3044 Generator: NPE when extracting the host part of a URL fails [nutch]
via GitHub
Re: [PR] NUTCH-3043 Generator: count URLs rejected by URL filters [nutch]
via GitHub
Re: [PR] NUTCH-3043 Generator: count URLs rejected by URL filters [nutch]
via GitHub
Re: [PR] NUTCH-3043 Generator: count URLs rejected by URL filters [nutch]
via GitHub
Re: [PR] NUTCH-3043 Generator: count URLs rejected by URL filters [nutch]
via GitHub
Re: [PR] NUTCH-3043 Generator: count URLs rejected by URL filters [nutch]
via GitHub
[jira] [Commented] (NUTCH-3044) Generator: NPE when extracting the host part of a URL fails
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3044) Generator: NPE when extracting the host part of a URL fails
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3044) Generator: NPE when extracting the host part of a URL fails
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3044) Generator: NPE when extracting the host part of a URL fails
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3044) Generator: NPE when extracting the host part of a URL fails
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3044) Generator: NPE when extracting the host part of a URL fails
Hudson (Jira)
[jira] [Created] (NUTCH-3044) Generator: NPE when extracting the host part of a URL fails
Sebastian Nagel (Jira)
[jira] [Commented] (NUTCH-3043) Generator: count URLs rejected by URL filters
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3043) Generator: count URLs rejected by URL filters
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3043) Generator: count URLs rejected by URL filters
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3043) Generator: count URLs rejected by URL filters
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3043) Generator: count URLs rejected by URL filters
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3043) Generator: count URLs rejected by URL filters
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3043) Generator: count URLs rejected by URL filters
Hudson (Jira)
[jira] [Created] (NUTCH-3043) Generator: count URLs rejected by URL filters
Sebastian Nagel (Jira)
[DISCUSS] Consolidating Nutch Continuous Integration
lewis john mcgibbney
Re: [DISCUSS] Consolidating Nutch Continuous Integration
Lewis John McGibbney
Re: [DISCUSS] Consolidating Nutch Continuous Integration
Sebastian Nagel
Re: [DISCUSS] Consolidating Nutch Continuous Integration
Lewis John McGibbney
Re: [PR] NUTCH-3041 Address confusing logging in o.a.n.net.URLExemptionFilters [nutch]
via GitHub
Re: [PR] NUTCH-3041 Address confusing logging in o.a.n.net.URLExemptionFilters [nutch]
via GitHub
[jira] [Updated] (NUTCH-3042) Use GitHub cache action to improve CI execution time
Lewis John McGibbney (Jira)
[jira] [Created] (NUTCH-3042) Use GitHub cache action to improve CI execution time
Lewis John McGibbney (Jira)
[jira] [Work started] (NUTCH-3041) Address confusing logging in o.a.n.net.URLExemptionFilters
Lewis John McGibbney (Jira)
[jira] [Commented] (NUTCH-3041) Address confusing logging in o.a.n.net.URLExemptionFilters
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3041) Address confusing logging in o.a.n.net.URLExemptionFilters
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3041) Address confusing logging in o.a.n.net.URLExemptionFilters
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3041) Address confusing logging in o.a.n.net.URLExemptionFilters
Hudson (Jira)
[jira] [Updated] (NUTCH-3041) Address confusing logging in o.a.n.net.URLExemptionFilters
Lewis John McGibbney (Jira)
[jira] [Updated] (NUTCH-3041) Address confusing logging in o.a.n.net.URLExemptionFilters
Lewis John McGibbney (Jira)
[jira] [Created] (NUTCH-3041) Address confusing logging in o.a.n.net.URLExemptionFilters
Lewis John McGibbney (Jira)
[jira] [Commented] (NUTCH-3040) Upgrade to Hadoop 3.4.0
Tim Allison (Jira)
[jira] [Created] (NUTCH-3040) Upgrade to Hadoop 3.4.0
Sebastian Nagel (Jira)
[jira] [Commented] (NUTCH-3039) Failure to handle ftp:// URLs
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3039) Failure to handle ftp:// URLs
Markus Jelsma (Jira)
[jira] [Commented] (NUTCH-3039) Failure to handle ftp:// URLs
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3039) Failure to handle ftp:// URLs
Hudson (Jira)
[jira] [Assigned] (NUTCH-3039) Failure to handle ftp:// URLs
Sebastian Nagel (Jira)
[PR] NUTCH-3039 Failure to handle ftp:// URLs [nutch]
via GitHub
Re: [PR] NUTCH-3039 Failure to handle ftp:// URLs [nutch]
via GitHub
Earlier messages