dev
Thread
Date
Earlier messages
Messages by Thread
[jira] [Updated] (NUTCH-3155) Add ErrorTracker to remaining MapReduce jobs missing error metrics
Lewis John McGibbney (Jira)
[jira] [Updated] (NUTCH-3155) Add ErrorTracker to remaining MapReduce jobs missing error metrics
Lewis John McGibbney (Jira)
[jira] [Work started] (NUTCH-3155) Missing ErrorTracker in CrawlDbFilter, DeduplicationJob, WebGraph and inconsistent initialization in FetcherThread
Lewis John McGibbney (Jira)
[jira] [Created] (NUTCH-3155) Missing ErrorTracker in CrawlDbFilter, DeduplicationJob, WebGraph and inconsistent initialization in FetcherThread
Lewis John McGibbney (Jira)
[jira] [Resolved] (NUTCH-3064) Upgrade index-geoip to GeoIP2 5.0.2
Sebastian Nagel (Jira)
[ANNOUNCE] Apache Nutch 1.22 Release
Sebastian Nagel
[RESULT] was [VOTE] Release Apache Nutch 1.22 RC#1
Sebastian Nagel
[jira] [Commented] (NUTCH-2931) Improvements to 1.x REST API
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-2931) Improvements to 1.x REST API
ASF GitHub Bot (Jira)
[PR] NUTCH-2931 Create OpenAPI specification for Nutch 1.x REST API [nutch]
via GitHub
Re: [PR] NUTCH-2931 Create OpenAPI specification for Nutch 1.x REST API [nutch]
via GitHub
[jira] [Assigned] (NUTCH-2932) Create OpenAPI specification for Nutch 1.x REST API
Lewis John McGibbney (Jira)
[PR] NUTCH-3154 Implement integration testing framework for Nutch IndexWriter plugins using Testcontainers [nutch]
via GitHub
Re: [PR] NUTCH-3154 Implement integration testing framework for Nutch IndexWriter plugins using Testcontainers [nutch]
via GitHub
Re: [PR] NUTCH-3154 Implement integration testing framework for Nutch IndexWriter plugins using Testcontainers [nutch]
via GitHub
Re: [PR] NUTCH-3154 Implement integration testing framework for Nutch IndexWriter plugins using Testcontainers [nutch]
via GitHub
[jira] [Commented] (NUTCH-3154) Implement integration testing framework for Nutch IndexWriter plugins using Testcontainers
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3154) Implement integration testing framework for Nutch IndexWriter plugins using Testcontainers
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3154) Implement integration testing framework for Nutch IndexWriter plugins using Testcontainers
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3154) Implement integration testing framework for Nutch IndexWriter plugins using Testcontainers
ASF GitHub Bot (Jira)
[jira] [Work started] (NUTCH-3154) Implement integration testing framework for Nutch IndexWriter plugins using Testcontainers
Lewis John McGibbney (Jira)
[jira] [Assigned] (NUTCH-3154) Implement integration testing framework for Nutch IndexWriter plugins using Testcontainers
Lewis John McGibbney (Jira)
[DISCUSS] Future of the Nutch REST API
lewis john mcgibbney
Re: [DISCUSS] Future of the Nutch REST API
Lewis John McGibbney
Re: [DISCUSS] Future of the Nutch REST API
Sebastian Nagel
Re: [DISCUSS] Future of the Nutch REST API
Isabelle Giguere
Re: [DISCUSS] Future of the Nutch REST API
Lewis John McGibbney
Re: [DISCUSS] Future of the Nutch REST API
Lewis John McGibbney
Re: [DISCUSS] Future of the Nutch REST API
BlackIce
Re: [DISCUSS] Future of the Nutch REST API
Joe Gilvary
Re: [DISCUSS] Future of the Nutch REST API
Isabelle Giguere
[VOTE] Release Apache Nutch 1.22 RC#1
Sebastian Nagel
[jira] [Resolved] (NUTCH-3153) Update of license and notice files
Sebastian Nagel (Jira)
[jira] [Updated] (NUTCH-3154) Implement integration testing framework for Nutch IndexWriter plugins using Testcontainers
Lewis John McGibbney (Jira)
[jira] [Created] (NUTCH-3154) Implement integration testing framework for Nutch IndexWriter plugins using Testcontainers
Lewis John McGibbney (Jira)
[jira] [Assigned] (NUTCH-3003) Consider integration testing in a Dockerized mini-hadoop cluster via testcontainers?
Lewis John McGibbney (Jira)
[jira] [Commented] (NUTCH-3153) Update of license and notice files
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3153) Update of license and notice files
ASF GitHub Bot (Jira)
[PR] NUTCH-3153 Update of license and notice files [nutch]
via GitHub
Re: [PR] NUTCH-3153 Update of license and notice files [nutch]
via GitHub
[jira] [Created] (NUTCH-3153) Update of license and notice files
Sebastian Nagel (Jira)
[jira] [Resolved] (NUTCH-3120) Automatically increase crawl-delay on HTTP 429
Sebastian Nagel (Jira)
[jira] [Commented] (NUTCH-3127) Deprecate or remove DmozParser
Sebastian Nagel (Jira)
[jira] [Resolved] (NUTCH-3152) Job counters getGroup to use metrics constants
Sebastian Nagel (Jira)
Build failed in Jenkins: Nutch » Nutch-trunk #219
Apache Jenkins Server
Jenkins build is back to normal : Nutch » Nutch-trunk #220
Apache Jenkins Server
[jira] [Resolved] (NUTCH-2793) CSV indexer does not work in distributed mode
Lewis John McGibbney (Jira)
[jira] [Work started] (NUTCH-3151) Dynamic Counter Management
Lewis John McGibbney (Jira)
[jira] [Resolved] (NUTCH-3150) Expand Caching Hadoop Counter References
Lewis John McGibbney (Jira)
[jira] [Commented] (NUTCH-3152) Job counters getGroup to use metrics constants
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3152) Job counters getGroup to use metrics constants
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3152) Job counters getGroup to use metrics constants
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3152) Job counters getGroup to use metrics constants
Hudson (Jira)
[PR] NUTCH-3152 Job counters getGroup to use metrics constants [nutch]
via GitHub
Re: [PR] NUTCH-3152 Job counters getGroup to use metrics constants [nutch]
via GitHub
Re: [PR] NUTCH-3152 Job counters getGroup to use metrics constants [nutch]
via GitHub
[jira] [Created] (NUTCH-3152) Job counters getGroup to use metrics constants
Sebastian Nagel (Jira)
Re: [PR] NUTCH-2793 indexer-csv: make it work in distributed mode [nutch]
via GitHub
Re: [PR] NUTCH-2793 indexer-csv: make it work in distributed mode [nutch]
via GitHub
Re: [PR] NUTCH-2793 indexer-csv: make it work in distributed mode [nutch]
via GitHub
Re: [PR] fix for NUTCH-2455 more efficient usage of hostdb in generate [nutch]
via GitHub
Re: [PR] fix for NUTCH-2455 more efficient usage of hostdb in generate [nutch]
via GitHub
[jira] [Created] (NUTCH-3151) Dynamic Counter Management
Lewis John McGibbney (Jira)
[jira] [Work started] (NUTCH-3150) Expand Caching Hadoop Counter References
Lewis John McGibbney (Jira)
[jira] [Commented] (NUTCH-3150) Expand Caching Hadoop Counter References
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3150) Expand Caching Hadoop Counter References
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3150) Expand Caching Hadoop Counter References
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3150) Expand Caching Hadoop Counter References
Hudson (Jira)
[jira] [Commented] (NUTCH-3150) Expand Caching Hadoop Counter References
Lewis John McGibbney (Jira)
[jira] [Commented] (NUTCH-3150) Expand Caching Hadoop Counter References
Lewis John McGibbney (Jira)
[PR] NUTCH-3150 Expand Caching Hadoop Counter References [nutch]
via GitHub
Re: [PR] NUTCH-3150 Expand Caching Hadoop Counter References [nutch]
via GitHub
Re: [PR] NUTCH-3150 Expand Caching Hadoop Counter References [nutch]
via GitHub
[jira] [Updated] (NUTCH-3150) Expand Caching Hadoop Counter References
Lewis John McGibbney (Jira)
[jira] [Created] (NUTCH-3150) Expand Caching Hadoop Counter References
Lewis John McGibbney (Jira)
[jira] [Work stopped] (NUTCH-3146) Add Resource Utilization Metrics for Fetcher
Lewis John McGibbney (Jira)
[jira] [Commented] (NUTCH-1732) IndexerMapReduce to delete explicitly not indexable documents
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-1732) IndexerMapReduce to delete explicitly not indexable documents
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-1732) IndexerMapReduce to delete explicitly not indexable documents
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-1732) IndexerMapReduce to delete explicitly not indexable documents
ASF GitHub Bot (Jira)
[PR] NUTCH-1732: allow deleting non-parsable documents [nutch]
via GitHub
Re: [PR] NUTCH-1732: allow deleting non-parsable documents [nutch]
via GitHub
Re: [PR] NUTCH-1732: allow deleting non-parsable documents [nutch]
via GitHub
Re: [PR] NUTCH-1732: allow deleting non-parsable documents [nutch]
via GitHub
[jira] [Work started] (NUTCH-3146) Add Resource Utilization Metrics for Fetcher
Lewis John McGibbney (Jira)
[jira] [Work stopped] (NUTCH-3142) Add Error Context to Metrics
Lewis John McGibbney (Jira)
[jira] [Resolved] (NUTCH-3142) Add Error Context to Metrics
Lewis John McGibbney (Jira)
[DISCUSS] Migrate to Java 17
Sebastian Nagel
Re: [DISCUSS] Migrate to Java 17
Isabelle Giguere
Re: [DISCUSS] Migrate to Java 17
BlackIce
Re: [DISCUSS] Migrate to Java 17
Sebastian Nagel
Re: [DISCUSS] Migrate to Java 17
Sebastian Nagel
Re: [DISCUSS] Migrate to Java 17
Edward Capriolo
Re: [DISCUSS] Migrate to Java 17
Joe Gilvary
Re: [DISCUSS] Migrate to Java 17
Lewis John McGibbney
Re: [DISCUSS] Migrate to Java 17
Sebastian Nagel
Re: [DISCUSS] Migrate to Java 17
Lewis John McGibbney
Re: [DISCUSS] Migrate to Java 17
Joe Gilvary
Re: [DISCUSS] Migrate to Java 17
Sebastian Nagel
[jira] [Resolved] (NUTCH-3110) Upgrade to Tika 3.2.3
Sebastian Nagel (Jira)
[jira] [Assigned] (NUTCH-3110) Upgrade to Tika 3.2.3
Sebastian Nagel (Jira)
[discuss] rolling nutch 1.22
lewis john mcgibbney
Re: [discuss] rolling nutch 1.22
BlackIce
Re: [discuss] rolling nutch 1.22
Joe Gilvary
Re: [discuss] rolling nutch 1.22
Sebastian Nagel
Re: [discuss] rolling nutch 1.22
Lewis John McGibbney
Re: [discuss] rolling nutch 1.22
Sebastian Nagel
Re: (nutch) branch master updated: NUTCH-3143 GitHub workflow does not run all unit tests (#889)
Doug Baber via dev
[jira] [Resolved] (NUTCH-3143) GitHub workflow does not run all unit tests
Lewis John McGibbney (Jira)
[jira] [Updated] (NUTCH-3143) GitHub workflow does not run all unit tests
Lewis John McGibbney (Jira)
[jira] [Assigned] (NUTCH-3034) Evolve the legacy Nutch plugin framework to use PF4J
Lewis John McGibbney (Jira)
[jira] [Updated] (NUTCH-3034) Evolve the legacy Nutch plugin framework to use PF4J
Lewis John McGibbney (Jira)
[jira] [Updated] (NUTCH-3149) Investigate Remote Shuffle Service Integration
Lewis John McGibbney (Jira)
[jira] [Created] (NUTCH-3149) Investigate Remote Shuffle Service Integration (Apache Uniffle / Celeborn) for Shuffle-Intensive Nutch Jobs
Lewis John McGibbney (Jira)
[jira] [Assigned] (NUTCH-2455) Use secondary sorting for memory-efficient HostDb integration in Generator
Lewis John McGibbney (Jira)
[PR] NUTCH-2455 Use secondary sorting for memory-efficient HostDb integration in Generator [nutch]
via GitHub
Re: [PR] NUTCH-2455 Use secondary sorting for memory-efficient HostDb integration in Generator [nutch]
via GitHub
Re: [PR] NUTCH-2455 Use secondary sorting for memory-efficient HostDb integration in Generator [nutch]
via GitHub
Re: [PR] NUTCH-2455 Use secondary sorting for memory-efficient HostDb integration in Generator [nutch]
via GitHub
Re: [PR] NUTCH-2455 Use secondary sorting for memory-efficient HostDb integration in Generator [nutch]
via GitHub
Re: [PR] NUTCH-2455 Use secondary sorting for memory-efficient HostDb integration in Generator [nutch]
via GitHub
Re: [PR] NUTCH-2455 Use secondary sorting for memory-efficient HostDb integration in Generator [nutch]
via GitHub
[jira] [Comment Edited] (NUTCH-2455) Use secondary sorting for memory-efficient HostDb integration in Generator
Lewis John McGibbney (Jira)
[jira] [Commented] (NUTCH-2455) Use secondary sorting for memory-efficient HostDb integration in Generator
Lewis John McGibbney (Jira)
[jira] [Commented] (NUTCH-2455) Use secondary sorting for memory-efficient HostDb integration in Generator
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-2455) Use secondary sorting for memory-efficient HostDb integration in Generator
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-2455) Use secondary sorting for memory-efficient HostDb integration in Generator
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-2455) Use secondary sorting for memory-efficient HostDb integration in Generator
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-2455) Use secondary sorting for memory-efficient HostDb integration in Generator
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-2455) Use secondary sorting for memory-efficient HostDb integration in Generator
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-2455) Use secondary sorting for memory-efficient HostDb integration in Generator
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-2455) Use secondary sorting for memory-efficient HostDb integration in Generator
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-2455) Use secondary sorting for memory-efficient HostDb integration in Generator
ASF GitHub Bot (Jira)
[jira] [Updated] (NUTCH-2455) Use secondary sorting for memory-efficient HostDb integration in Generator
Lewis John McGibbney (Jira)
[jira] [Updated] (NUTCH-2455) Use secondary sorting for memory-efficient HostDb integration in Generator
Sebastian Nagel (Jira)
Build failed in Jenkins: Nutch » Nutch-trunk #214
Apache Jenkins Server
Jenkins build is back to normal : Nutch » Nutch-trunk #215
Apache Jenkins Server
[jira] [Resolved] (NUTCH-3042) Use GitHub cache action to improve CI execution time
Lewis John McGibbney (Jira)
[jira] [Resolved] (NUTCH-3148) Cache Ivy dependencies in GitHub CI builds
Lewis John McGibbney (Jira)
[jira] [Commented] (NUTCH-3110) Upgrade to Tika 3.2.3
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3110) Upgrade to Tika 3.2.3
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3110) Upgrade to Tika 3.2.3
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3110) Upgrade to Tika 3.2.3
Hudson (Jira)
[jira] [Commented] (NUTCH-3110) Upgrade to Tika 3.2.3
Tim Allison (Jira)
[PR] NUTCH-3110 Upgrade to Tika 3.2.3 [nutch]
via GitHub
Re: [PR] NUTCH-3110 Upgrade to Tika 3.2.3 [nutch]
via GitHub
[jira] [Updated] (NUTCH-3110) Upgrade to Tika 3.2.3
Lewis John McGibbney (Jira)
[jira] [Resolved] (NUTCH-1564) AdaptiveFetchSchedule: sync_delta forces immediate refetch for documents not modified
Sebastian Nagel (Jira)
[jira] [Resolved] (NUTCH-3144) URLUtil unit tests fail after upgrade to crawler-commons 1.6
Sebastian Nagel (Jira)
[jira] [Work started] (NUTCH-3148) Cache Ivy dependencies in GitHub CI builds
Lewis John McGibbney (Jira)
[jira] [Commented] (NUTCH-3148) Cache Ivy dependencies in GitHub CI builds
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3148) Cache Ivy dependencies in GitHub CI builds
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3148) Cache Ivy dependencies in GitHub CI builds
Hudson (Jira)
[PR] NUTCH-3148 Cache Ivy dependencies in GitHub CI builds [nutch]
via GitHub
Re: [PR] NUTCH-3148 Cache Ivy dependencies in GitHub CI builds [nutch]
via GitHub
[jira] [Created] (NUTCH-3148) Cache Ivy dependencies in GitHub CI builds
Lewis John McGibbney (Jira)
[PR] NUTCH-3143 GitHub workflow does not run all unit tests [nutch]
via GitHub
Re: [PR] NUTCH-3143 GitHub workflow does not run all unit tests [nutch]
via GitHub
Re: [PR] NUTCH-3143 GitHub workflow does not run all unit tests [nutch]
via GitHub
[PR] NUTCH-3143 GitHub workflow does not run all unit tests [nutch]
via GitHub
Re: [PR] NUTCH-3143 GitHub workflow does not run all unit tests [nutch]
via GitHub
[PR] NUTCH-3143 GitHub workflow does not run all unit tests [nutch]
via GitHub
Re: [PR] NUTCH-3143 GitHub workflow does not run all unit tests [nutch]
via GitHub
[PR] NUTCH-3143 GitHub workflow does not run all unit tests [nutch]
via GitHub
Re: [PR] NUTCH-3143 GitHub workflow does not run all unit tests [nutch]
via GitHub
Re: [PR] NUTCH-3143 GitHub workflow does not run all unit tests [nutch]
via GitHub
[jira] [Assigned] (NUTCH-3143) GitHub workflow does not run all unit tests
Lewis John McGibbney (Jira)
[jira] [Created] (NUTCH-3147) Nutch JMX Metrics Evolution with OpenTelemetry
Lewis John McGibbney (Jira)
[jira] [Work started] (NUTCH-3145) Upgrade to JUnit 6
Lewis John McGibbney (Jira)
[jira] [Commented] (NUTCH-3145) Upgrade to JUnit 6
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3145) Upgrade to JUnit 6
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3145) Upgrade to JUnit 6
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3145) Upgrade to JUnit 6
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3145) Upgrade to JUnit 6
Sebastian Nagel (Jira)
[jira] [Commented] (NUTCH-3145) Upgrade to JUnit 6
ASF GitHub Bot (Jira)
[PR] NUTCH-3145 Upgrade to JUnit 6 [nutch]
via GitHub
Re: [PR] NUTCH-3145 Upgrade to JUnit 6 [nutch]
via GitHub
Re: [PR] NUTCH-3145 Upgrade to JUnit 6 [nutch]
via GitHub
Re: [PR] NUTCH-3145 Upgrade to JUnit 6 [nutch]
via GitHub
Re: [PR] NUTCH-3145 Upgrade to JUnit 6 [nutch]
via GitHub
[jira] [Assigned] (NUTCH-3145) Upgrade to JUnit 6
Lewis John McGibbney (Jira)
[jira] [Created] (NUTCH-3146) Add Resource Utilization Metrics for Fetcher
Lewis John McGibbney (Jira)
[jira] [Commented] (NUTCH-3064) Upgrade index-geoip to GeoIP2 5.0.2
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3064) Upgrade index-geoip to GeoIP2 5.0.2
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3064) Upgrade index-geoip to GeoIP2 5.0.2
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3064) Upgrade index-geoip to GeoIP2 5.0.2
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3064) Upgrade index-geoip to GeoIP2 5.0.2
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3064) Upgrade index-geoip to GeoIP2 5.0.2
Sebastian Nagel (Jira)
[jira] [Commented] (NUTCH-3064) Upgrade index-geoip to GeoIP2 5.0.2
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3064) Upgrade index-geoip to GeoIP2 5.0.2
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3064) Upgrade index-geoip to GeoIP2 5.0.2
Hudson (Jira)
[jira] [Work stopped] (NUTCH-3064) Upgrade com.maxmind.geoip2:geoip2 dependency in geoip-index to v4.2.0
Lewis John McGibbney (Jira)
[jira] [Updated] (NUTCH-3064) Upgrade index-geoip to GeoIP2 5.0.2
Lewis John McGibbney (Jira)
[jira] [Updated] (NUTCH-3064) Upgrade index-geoip to GeoIP2 5.0.2
Sebastian Nagel (Jira)
[jira] [Commented] (NUTCH-3142) Add Error Context to Metrics
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3142) Add Error Context to Metrics
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3142) Add Error Context to Metrics
Hudson (Jira)
[PR] NUTCH-3142 Add Error Context to Metrics [nutch]
via GitHub
Re: [PR] NUTCH-3142 Add Error Context to Metrics [nutch]
via GitHub
[jira] [Work started] (NUTCH-3142) Add Error Context to Metrics
Lewis John McGibbney (Jira)
Earlier messages