dev
Thread
Date
Earlier messages
Later messages
Messages by Thread
Re: [PR] [NUTCH-2834] Update crawl documentation / Fix #557 [nutch]
via GitHub
[jira] [Closed] (NUTCH-3024) Remove flaky 'dependency check' target
Lewis John McGibbney (Jira)
[jira] [Resolved] (NUTCH-3024) Remove flaky 'dependency check' target
Lewis John McGibbney (Jira)
[jira] [Commented] (NUTCH-3026) Allow statusOnly option for IndexingJob
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3026) Allow statusOnly option for IndexingJob
Tim Allison (Jira)
[jira] [Commented] (NUTCH-3026) Allow statusOnly option for IndexingJob
Tim Allison (Jira)
[jira] [Commented] (NUTCH-3026) Allow statusOnly option for IndexingJob
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3026) Allow statusOnly option for IndexingJob
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3026) Allow statusOnly option for IndexingJob
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3026) Allow statusOnly option for IndexingJob
Tim Allison (Jira)
[jira] [Commented] (NUTCH-3026) Allow statusOnly option for IndexingJob
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3026) Allow statusOnly option for IndexingJob
ASF GitHub Bot (Jira)
[PR] NUTCH-3026 -- add statusOnly as an indexing option [nutch]
via GitHub
Re: [PR] NUTCH-3026 -- add statusOnly as an indexing option [nutch]
via GitHub
Re: [PR] NUTCH-3026 -- add statusOnly as an indexing option [nutch]
via GitHub
Re: [PR] NUTCH-3026 -- add statusOnly as an indexing option [nutch]
via GitHub
Re: [PR] NUTCH-3026 -- add statusOnly as an indexing option [nutch]
via GitHub
[jira] [Updated] (NUTCH-3026) Allow statusOnly option for IndexingJob
Tim Allison (Jira)
[jira] [Updated] (NUTCH-3026) Allow statusOnly option for IndexingJob
Tim Allison (Jira)
[jira] [Updated] (NUTCH-3026) Allow statusOnly option for IndexingJob
Tim Allison (Jira)
[jira] [Created] (NUTCH-3026) Allow statusOnly option for IndexingJob
Tim Allison (Jira)
[jira] [Closed] (NUTCH-3007) Fix impossible casts
Lewis John McGibbney (Jira)
[jira] [Closed] (NUTCH-2846) Fix various bugs spotted by NUTCH-2815
Lewis John McGibbney (Jira)
[jira] [Closed] (NUTCH-2852) Method invokes System.exit(...) 9 bugs
Lewis John McGibbney (Jira)
[jira] [Closed] (NUTCH-2819) Move spotbugs "installation" directory to avoid that spotbugs is shipped in Nutch runtime
Lewis John McGibbney (Jira)
[jira] [Closed] (NUTCH-2851) Random object created and used only once
Lewis John McGibbney (Jira)
[jira] [Closed] (NUTCH-2850) Method ignores exceptional return value
Lewis John McGibbney (Jira)
Voice Of Apache (Formerly Feathercast) podcast request
Rich Bowen
[jira] [Commented] (NUTCH-2812) Methods returning array may expose internal representation
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-2812) Methods returning array may expose internal representation
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-2812) Methods returning array may expose internal representation
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-2812) Methods returning array may expose internal representation
Hudson (Jira)
[PR] fix for NUTCH-2812 contributed by GabeHaegele [nutch]
via GitHub
Re: [PR] fix for NUTCH-2812 contributed by GabeHaegele [nutch]
via GitHub
Re: [PR] fix for NUTCH-2812 contributed by GabeHaegele [nutch]
via GitHub
[jira] [Resolved] (NUTCH-3025) urlfilter-fast to filter based on the length of the URL
Sebastian Nagel (Jira)
[jira] [Updated] (NUTCH-3025) urlfilter-fast to filter based on the length of the URL
Sebastian Nagel (Jira)
[jira] [Resolved] (NUTCH-3017) Allow fast-urlfilter to load from HDFS/S3 and support gzipped input
Sebastian Nagel (Jira)
Re: [PR] [NUTCH-3025] urlfilter-fast to filter based on the length of the URL [nutch]
via GitHub
Re: [PR] [NUTCH-3025] urlfilter-fast to filter based on the length of the URL [nutch]
via GitHub
Re: [PR] [NUTCH-3025] urlfilter-fast to filter based on the length of the URL [nutch]
via GitHub
Re: [PR] [NUTCH-3025] urlfilter-fast to filter based on the length of the URL [nutch]
via GitHub
Re: [PR] [NUTCH-3025] urlfilter-fast to filter based on the length of the URL [nutch]
via GitHub
Re: [PR] [NUTCH-3025] urlfilter-fast to filter based on the length of the URL [nutch]
via GitHub
[jira] [Resolved] (NUTCH-3020) ParseSegment should check for protocol's flags for truncation
Tim Allison (Jira)
Build failed in Jenkins: Nutch » Nutch-trunk #139
Apache Jenkins Server
Jenkins build is back to normal : Nutch » Nutch-trunk #140
Apache Jenkins Server
[jira] [Resolved] (NUTCH-3019) Upgrade to Apache Tika 2.9.1
Tim Allison (Jira)
[jira] [Comment Edited] (NUTCH-3019) Upgrade to Apache Tika 2.9.1
Tim Allison (Jira)
[jira] [Comment Edited] (NUTCH-3019) Upgrade to Apache Tika 2.9.1
Tim Allison (Jira)
[PR] NUTCH-3019 -- update Tika to 2.9.1 [nutch]
via GitHub
Re: [PR] NUTCH-3019 -- update Tika to 2.9.1 [nutch]
via GitHub
Re: [PR] NUTCH-3019 -- update Tika to 2.9.1 [nutch]
via GitHub
Re: [PR] NUTCH-3019 -- update Tika to 2.9.1 [nutch]
via GitHub
[jira] [Commented] (NUTCH-3025) urlfilter-fast to filter based on the length of the URL
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3025) urlfilter-fast to filter based on the length of the URL
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3025) urlfilter-fast to filter based on the length of the URL
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3025) urlfilter-fast to filter based on the length of the URL
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3025) urlfilter-fast to filter based on the length of the URL
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3025) urlfilter-fast to filter based on the length of the URL
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3025) urlfilter-fast to filter based on the length of the URL
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3025) urlfilter-fast to filter based on the length of the URL
Hudson (Jira)
[jira] [Created] (NUTCH-3025) urlfilter-fast to filter based on the length of the URL
Julien Nioche (Jira)
[jira] [Commented] (NUTCH-3024) Remove flaky 'dependency check' target
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3024) Remove flaky 'dependency check' target
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3024) Remove flaky 'dependency check' target
Hudson (Jira)
[PR] NUTCH-3024 Remove flaky 'dependency check' target [nutch]
via GitHub
Re: [PR] NUTCH-3024 Remove flaky 'dependency check' target [nutch]
via GitHub
[jira] [Created] (NUTCH-3024) Remove flaky 'dependency check' target
Lewis John McGibbney (Jira)
Removing “dependency-check” target from build.xml
lewis john mcgibbney
[jira] [Created] (NUTCH-3023) Use mikepenz/action-junit-report to improve interpretation of failed tests during CI
Lewis John McGibbney (Jira)
[jira] [Closed] (NUTCH-3014) Standardize Job names
Lewis John McGibbney (Jira)
[jira] [Resolved] (NUTCH-3014) Standardize Job names
Lewis John McGibbney (Jira)
[jira] [Created] (NUTCH-3022) Experiment formatting codebase per google-java-format
Lewis John McGibbney (Jira)
[jira] [Work stopped] (NUTCH-3014) Standardize Job names
Lewis John McGibbney (Jira)
[jira] [Commented] (NUTCH-3020) ParseSegment should check for protocol's flags for truncation
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3020) ParseSegment should check for protocol's flags for truncation
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3020) ParseSegment should check for protocol's flags for truncation
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3020) ParseSegment should check for protocol's flags for truncation
Hudson (Jira)
[PR] NUTCH-3020 -- ParseSegment should check for okhttp's truncation flag [nutch]
via GitHub
Re: [PR] NUTCH-3020 -- ParseSegment should check for okhttp's truncation flag [nutch]
via GitHub
Re: [PR] NUTCH-3020 -- ParseSegment should check for okhttp's truncation flag [nutch]
via GitHub
[jira] [Created] (NUTCH-3021) Improve http-protocol to identify truncated content
Tim Allison (Jira)
[jira] [Created] (NUTCH-3020) ParseSegment should check for protocol's flags for truncation
Tim Allison (Jira)
[jira] [Comment Edited] (NUTCH-3018) Consider pooling remote webdrivers for Selenium?
Tim Allison (Jira)
[jira] [Comment Edited] (NUTCH-3018) Consider pooling remote webdrivers for Selenium?
Tim Allison (Jira)
[jira] [Commented] (NUTCH-3018) Consider pooling remote webdrivers for Selenium?
Tim Allison (Jira)
[jira] [Commented] (NUTCH-3018) Consider pooling remote webdrivers for Selenium?
Tim Allison (Jira)
[jira] [Updated] (NUTCH-3018) Consider pooling remote webdrivers for Selenium?
Tim Allison (Jira)
[jira] [Updated] (NUTCH-3018) Consider pooling remote webdrivers for Selenium?
Tim Allison (Jira)
[jira] [Commented] (NUTCH-3019) Upgrade to Apache Tika 2.9.1
Tim Allison (Jira)
[jira] [Commented] (NUTCH-3019) Upgrade to Apache Tika 2.9.1
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3019) Upgrade to Apache Tika 2.9.1
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3019) Upgrade to Apache Tika 2.9.1
Tim Allison (Jira)
[jira] [Commented] (NUTCH-3019) Upgrade to Apache Tika 2.9.1
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3019) Upgrade to Apache Tika 2.9.1
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3019) Upgrade to Apache Tika 2.9.1
Hudson (Jira)
[jira] [Commented] (NUTCH-3019) Upgrade to Apache Tika 2.9.1
Tim Allison (Jira)
[jira] [Created] (NUTCH-3019) Upgrade to Apache Tika 2.9.1
Tim Allison (Jira)
[jira] [Resolved] (NUTCH-2959) Upgrade to Apache Tika 2.9.0
Tim Allison (Jira)
[jira] [Created] (NUTCH-3018) Consider pooling remote webdrivers for Selenium?
Tim Allison (Jira)
Re: [PR] [NUTCH-3017] Allow fast-urlfilter to load from HDFS/S3 [nutch]
via GitHub
Re: [PR] [NUTCH-3017] Allow fast-urlfilter to load from HDFS/S3 [nutch]
via GitHub
Re: [PR] [NUTCH-3017] Allow fast-urlfilter to load from HDFS/S3 [nutch]
via GitHub
Re: [PR] [NUTCH-3017] Allow fast-urlfilter to load from HDFS/S3 [nutch]
via GitHub
[jira] [Commented] (NUTCH-3017) Allow fast-urlfilter to load from HDFS/S3 and support gzipped input
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3017) Allow fast-urlfilter to load from HDFS/S3 and support gzipped input
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3017) Allow fast-urlfilter to load from HDFS/S3 and support gzipped input
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3017) Allow fast-urlfilter to load from HDFS/S3 and support gzipped input
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3017) Allow fast-urlfilter to load from HDFS/S3 and support gzipped input
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3017) Allow fast-urlfilter to load from HDFS/S3 and support gzipped input
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3017) Allow fast-urlfilter to load from HDFS/S3 and support gzipped input
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3017) Allow fast-urlfilter to load from HDFS/S3 and support gzipped input
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3017) Allow fast-urlfilter to load from HDFS/S3 and support gzipped input
Sebastian Nagel (Jira)
[jira] [Commented] (NUTCH-3017) Allow fast-urlfilter to load from HDFS/S3 and support gzipped input
Hudson (Jira)
[PR] Allow fast-urlfilter to load from HDFS/S3 and support gzipped input [NUTCH-3017] [nutch]
via GitHub
Re: [PR] Allow fast-urlfilter to load from HDFS/S3 and support gzipped input [NUTCH-3017] [nutch]
via GitHub
Re: [PR] Allow fast-urlfilter to load from HDFS/S3 and support gzipped input [NUTCH-3017] [nutch]
via GitHub
[jira] [Updated] (NUTCH-3017) Allow fast-urlfilter to load from HDFS/S3 and support gzipped input
Sebastian Nagel (Jira)
[jira] [Updated] (NUTCH-3017) Allow fast-urlfilter to load from HDFS/S3 and support gzipped input
Sebastian Nagel (Jira)
[jira] [Created] (NUTCH-3017) Allow fast-urlfilter to load from HDFS/S3 and support gzipped input
Julien Nioche (Jira)
Call for Presentations now open: Community over Code EU 2024
Ryan Skraba
Build failed in Jenkins: Nutch » Nutch-trunk #135
Apache Jenkins Server
Build failed in Jenkins: Nutch » Nutch-trunk #136
Apache Jenkins Server
Build failed in Jenkins: Nutch » Nutch-trunk #137
Apache Jenkins Server
Jenkins build is back to normal : Nutch » Nutch-trunk #138
Apache Jenkins Server
[jira] [Work stopped] (NUTCH-3015) Add more CI steps to GitHub master-build.yml
Lewis John McGibbney (Jira)
[jira] [Closed] (NUTCH-3015) Add more CI steps to GitHub master-build.yml
Lewis John McGibbney (Jira)
[jira] [Resolved] (NUTCH-3015) Add more CI steps to GitHub master-build.yml
Lewis John McGibbney (Jira)
Re: [PR] NUTCH-2887 Migrate to JUnit 5 Jupiter [nutch]
via GitHub
[jira] [Work started] (NUTCH-2887) Migrate to JUnit 5 Jupiter
Lewis John McGibbney (Jira)
[jira] [Created] (NUTCH-3016) Upgrade Apache Ivy to 2.5.2
Lewis John McGibbney (Jira)
[jira] [Assigned] (NUTCH-2887) Migrate to JUnit 5 Jupiter
Lewis John McGibbney (Jira)
[jira] [Work started] (NUTCH-3015) Add more CI steps to GitHub master-build.yml
Lewis John McGibbney (Jira)
[jira] [Work started] (NUTCH-3014) Standardize Job names
Lewis John McGibbney (Jira)
Nutch codebase formatting
lewis john mcgibbney
Re: Nutch codebase formatting
Lewis John McGibbney
Re: Nutch codebase formatting
Sebastian Nagel
Re: Nutch codebase formatting
Lewis John McGibbney
[jira] [Commented] (NUTCH-3015) Add more CI steps to GitHub master-build.yml
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3015) Add more CI steps to GitHub master-build.yml
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3015) Add more CI steps to GitHub master-build.yml
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3015) Add more CI steps to GitHub master-build.yml
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3015) Add more CI steps to GitHub master-build.yml
Hudson (Jira)
[PR] NUTCH-3015 Add more CI steps to GitHub master-build.yml [nutch]
via GitHub
Re: [PR] NUTCH-3015 Add more CI steps to GitHub master-build.yml [nutch]
via GitHub
Re: [PR] NUTCH-3015 Add more CI steps to GitHub master-build.yml [nutch]
via GitHub
Re: [PR] NUTCH-3015 Add more CI steps to GitHub master-build.yml [nutch]
via GitHub
[jira] [Created] (NUTCH-3015) Add more CI steps to GitHub master-build.yml
Lewis John McGibbney (Jira)
[jira] [Commented] (NUTCH-3014) Standardize Job names
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3014) Standardize Job names
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3014) Standardize Job names
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3014) Standardize Job names
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3014) Standardize Job names
Hudson (Jira)
[PR] NUTCH-3014 Standardize Job names [nutch]
via GitHub
Re: [PR] NUTCH-3014 Standardize Job names [nutch]
via GitHub
Re: [PR] NUTCH-3014 Standardize Job names [nutch]
via GitHub
Re: [PR] NUTCH-3014 Standardize Job names [nutch]
via GitHub
[jira] [Updated] (NUTCH-3014) Standardize Job names
Lewis John McGibbney (Jira)
[jira] [Updated] (NUTCH-3014) Standardize Job names
Lewis John McGibbney (Jira)
[jira] [Updated] (NUTCH-3014) Standardize Job names
Lewis John McGibbney (Jira)
[jira] [Resolved] (NUTCH-3013) Employ commons-lang3's StopWatch to simplify timing logic
Lewis John McGibbney (Jira)
[jira] [Closed] (NUTCH-3013) Employ commons-lang3's StopWatch to simplify timing logic
Lewis John McGibbney (Jira)
[jira] [Resolved] (NUTCH-3012) SegmentReader when dumping with option -recode: NPE on unparsed documents
Sebastian Nagel (Jira)
[jira] [Resolved] (NUTCH-3011) HttpRobotRulesParser: handle HTTP 429 Too Many Requests same as server errors (HTTP 5xx)
Sebastian Nagel (Jira)
[jira] [Resolved] (NUTCH-2990) HttpRobotRulesParser to follow 5 redirects as specified by RFC 9309
Sebastian Nagel (Jira)
[jira] [Assigned] (NUTCH-3009) Upgrade to Hadoop 3.3.6
Sebastian Nagel (Jira)
[jira] [Resolved] (NUTCH-3009) Upgrade to Hadoop 3.3.6
Sebastian Nagel (Jira)
[jira] [Resolved] (NUTCH-3006) Downgrade Tika dependency to 2.2.1 (core and parse-tika)
Sebastian Nagel (Jira)
[jira] [Assigned] (NUTCH-3002) Protocol-okhttp HttpResponse: HTTP header metadata lookup should be case-insensitive
Sebastian Nagel (Jira)
[jira] [Resolved] (NUTCH-3002) Protocol-okhttp HttpResponse: HTTP header metadata lookup should be case-insensitive
Sebastian Nagel (Jira)
[jira] [Commented] (NUTCH-3014) Standardize NutchJob job names
Sebastian Nagel (Jira)
[jira] [Created] (NUTCH-3014) Standardize NutchJob job names
Lewis John McGibbney (Jira)
[jira] [Work started] (NUTCH-3013) Employ commons-lang3's StopWatch to simplify timing logic
Lewis John McGibbney (Jira)
[jira] [Commented] (NUTCH-3013) Employ commons-lang3's StopWatch to simplify timing logic
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3013) Employ commons-lang3's StopWatch to simplify timing logic
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3013) Employ commons-lang3's StopWatch to simplify timing logic
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3013) Employ commons-lang3's StopWatch to simplify timing logic
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3013) Employ commons-lang3's StopWatch to simplify timing logic
Hudson (Jira)
[PR] NUTCH-3013 Employ commons-lang3's StopWatch to simplify timing logic [nutch]
via GitHub
Re: [PR] NUTCH-3013 Employ commons-lang3's StopWatch to simplify timing logic [nutch]
via GitHub
Re: [PR] NUTCH-3013 Employ commons-lang3's StopWatch to simplify timing logic [nutch]
via GitHub
Re: [PR] NUTCH-3013 Employ commons-lang3's StopWatch to simplify timing logic [nutch]
via GitHub
[jira] [Created] (NUTCH-3013) Employ commons-lang3's StopWatch to simplify timing logic
Lewis John McGibbney (Jira)
[jira] [Commented] (NUTCH-3012) SegmentReader when dumping with option -recode: NPE on unparsed documents
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3012) SegmentReader when dumping with option -recode: NPE on unparsed documents
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3012) SegmentReader when dumping with option -recode: NPE on unparsed documents
Hudson (Jira)
[PR] NUTCH-3012 SegmentReader when dumping with option -recode: NPE on unarsed documents [nutch]
via GitHub
Re: [PR] NUTCH-3012 SegmentReader when dumping with option -recode: NPE on unarsed documents [nutch]
via GitHub
[jira] [Updated] (NUTCH-3012) SegmentReader when dumping with option -recode: NPE on unparsed documents
Sebastian Nagel (Jira)
[jira] [Updated] (NUTCH-3012) SegmentReader when dumping with option -recode: NPE on unparsed documents
Sebastian Nagel (Jira)
[jira] [Created] (NUTCH-3012) SegmentReader when dumping with option -recode: NPE on documents without charset defined
Sebastian Nagel (Jira)
[jira] [Commented] (NUTCH-3002) Protocol-okhttp HttpResponse: HTTP header metadata lookup should be case-insensitive
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3002) Protocol-okhttp HttpResponse: HTTP header metadata lookup should be case-insensitive
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3002) Protocol-okhttp HttpResponse: HTTP header metadata lookup should be case-insensitive
Hudson (Jira)
[jira] [Resolved] (NUTCH-1130) JUnit test for Any23 RDF plugin
Sebastian Nagel (Jira)
[jira] [Closed] (NUTCH-1130) JUnit test for Any23 RDF plugin
Sebastian Nagel (Jira)
[jira] [Resolved] (NUTCH-2938) Use Any23's RepositoryWriter to write structured data to Rdf4j repository
Sebastian Nagel (Jira)
[jira] [Closed] (NUTCH-2938) Use Any23's RepositoryWriter to write structured data to Rdf4j repository
Sebastian Nagel (Jira)
[jira] [Updated] (NUTCH-2938) Use Any23's RepositoryWriter to write structured data to Rdf4j repository
Sebastian Nagel (Jira)
Earlier messages
Later messages