dev
Thread
Date
Earlier messages
Messages by Thread
[jira] [Resolved] (NUTCH-2856) Implement a protocol-smb plugin based on hierynomus/smbj
Hiran Chaudhuri (Jira)
Time for a Nutch release?
lewis john mcgibbney
Re: Time for a Nutch release?
BlackIce
Re: Time for a Nutch release?
Joe Gilvary
[PR] NUTCH-3110 Upgrade to Tika 3.1.0 [nutch]
via GitHub
[jira] [Resolved] (NUTCH-3108) Fix SLF4J Class Loader Conflict in language-identifier
Sebastian Nagel (Jira)
[jira] [Commented] (NUTCH-3110) Upgrade to Tika 3.1.0
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3110) Upgrade to Tika 3.1.0
Tim Allison (Jira)
[jira] [Created] (NUTCH-3110) Upgrade to Tika 3.1.0
Sebastian Nagel (Jira)
[jira] [Assigned] (NUTCH-3108) Fix SLF4J Class Loader Conflict in language-identifier
Sebastian Nagel (Jira)
[jira] [Updated] (NUTCH-3108) Fix SLF4J Class Loader Conflict in language-identifier
Sebastian Nagel (Jira)
[jira] [Updated] (NUTCH-3108) Fix SLF4J Class Loader Conflict in language-identifier
Sebastian Nagel (Jira)
[jira] [Commented] (NUTCH-3109) Unable to update CrawlDB due to URL normalization
Sebastian Nagel (Jira)
[jira] [Commented] (NUTCH-3109) Unable to update CrawlDB due to URL normalization
Markus Jelsma (Jira)
[jira] [Commented] (NUTCH-3109) Unable to update CrawlDB due to URL normalization
Markus Jelsma (Jira)
[jira] [Created] (NUTCH-3109) Unable to update CrawlDB due to URL normalization
Markus Jelsma (Jira)
[jira] [Commented] (NUTCH-3108) Fix SLF4J Class Loader Conflict in language-identifier
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3108) Fix SLF4J Class Loader Conflict in language-identifier
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3108) Fix SLF4J Class Loader Conflict in language-identifier
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3108) Fix SLF4J Class Loader Conflict in language-identifier
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3108) Fix SLF4J Class Loader Conflict in language-identifier
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3108) Fix SLF4J Class Loader Conflict in language-identifier
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3108) Fix SLF4J Class Loader Conflict in language-identifier
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3108) Fix SLF4J Class Loader Conflict in language-identifier
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3108) Fix SLF4J Class Loader Conflict in language-identifier
Hudson (Jira)
[PR] NUTCH-3108 Fix SLF4J Class Loader Conflict in language-identifier [nutch]
via GitHub
Re: [PR] NUTCH-3108 Fix SLF4J Class Loader Conflict in language-identifier [nutch]
via GitHub
Re: [PR] NUTCH-3108 Fix SLF4J Class Loader Conflict in language-identifier [nutch]
via GitHub
Re: [PR] NUTCH-3108 Fix SLF4J Class Loader Conflict in language-identifier [nutch]
via GitHub
Re: [PR] NUTCH-3108 Fix SLF4J Class Loader Conflict in language-identifier [nutch]
via GitHub
Re: [PR] NUTCH-3108 Fix SLF4J Class Loader Conflict in language-identifier [nutch]
via GitHub
Re: [PR] NUTCH-3108 Fix SLF4J Class Loader Conflict in language-identifier [nutch]
via GitHub
Re: [PR] NUTCH-3108 Fix SLF4J Class Loader Conflict in language-identifier [nutch]
via GitHub
[jira] [Created] (NUTCH-3108) Fix SLF4J Class Loader Conflict in language-identifier
Maciej Puzianowski (Jira)
[jira] [Commented] (NUTCH-3107) QueryString normalizer to support per-host removal of qstr params
Markus Jelsma (Jira)
[jira] [Commented] (NUTCH-3107) QueryString normalizer to support per-host removal of qstr params
Markus Jelsma (Jira)
[jira] [Updated] (NUTCH-3107) QueryString normalizer to support per-host removal of qstr params
Markus Jelsma (Jira)
[jira] [Updated] (NUTCH-3107) QueryString normalizer to support per-host removal of qstr params
Markus Jelsma (Jira)
[jira] [Updated] (NUTCH-3107) QueryString normalizer to support per-host removal of qstr params
Markus Jelsma (Jira)
[jira] [Updated] (NUTCH-3107) QueryString normalizer to support per-host removal of qstr params
Markus Jelsma (Jira)
[jira] [Created] (NUTCH-3107) QueryString normalizer to support per-host removal of qstr params
Markus Jelsma (Jira)
[jira] [Comment Edited] (NUTCH-3103) Improper fetch interval given as example
Martin Djukanovic (Jira)
[PR] [NUTCH-3103] Fixed custom max intervals for AdaptiveFetchSchedule [nutch]
via GitHub
[jira] [Commented] (NUTCH-3103) Improper fetch interval given as example
Martin Djukanovic (Jira)
[jira] [Commented] (NUTCH-3103) Improper fetch interval given as example
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3103) Improper fetch interval given as example
Markus Jelsma (Jira)
[jira] [Commented] (NUTCH-3106) Issue with SSLHandshakeException in v1.20 using protocol-http plugin
ASF GitHub Bot (Jira)
[PR] [NUTCH-3106] fix Issue with SSLHandshakeException [nutch]
via GitHub
[jira] [Created] (NUTCH-3106) Issue with SSLHandshakeException in v1.20 using protocol-http plugin
hanbing (Jira)
[jira] [Updated] (NUTCH-3105) WARC exporter does not support Jexl expression if parseData is not loaded
Markus Jelsma (Jira)
[jira] [Updated] (NUTCH-3105) WARC exporter does not support Jexl expression if parseData is not loaded
Markus Jelsma (Jira)
[jira] [Created] (NUTCH-3105) WARC exporter does not support Jexl expression if parseData is not loaded
Markus Jelsma (Jira)
[jira] [Created] (NUTCH-3104) [SECURITY] please replace use of org.codehaus.jackson
PJ Fanning (Jira)
[jira] [Assigned] (NUTCH-3103) Improper fetch interval given as example
Markus Jelsma (Jira)
[jira] [Created] (NUTCH-3103) Improper fetch interval given as example
Isabelle Giguere (Jira)
clone repos with nutch
anon anon
Re: clone repos with nutch
Sebastian Nagel
Re: clone repos with nutch
anon anon
Re: clone repos with nutch
anon anon
[jira] [Resolved] (NUTCH-3100) HostDB to support minimum records per host
Markus Jelsma (Jira)
[jira] [Resolved] (NUTCH-3101) LinkDb's Inlink class to support metadata
Markus Jelsma (Jira)
make more documentation tutorial
anon anon
Fwd: I may fork nutch. Is it a good plan?
anon anon
[jira] [Updated] (NUTCH-3099) Allow wildcard '*' in http.proxy.exception.list
Sebastian Nagel (Jira)
[jira] [Commented] (NUTCH-3102) CrawlDbReader -stats fails with Cannot add NaN to t-digest
Sebastian Nagel (Jira)
[jira] [Commented] (NUTCH-3102) CrawlDbReader -stats fails with Cannot add NaN to t-digest
Sebastian Nagel (Jira)
[jira] [Commented] (NUTCH-3102) CrawlDbReader -stats fails with Cannot add NaN to t-digest
Sebastian Nagel (Jira)
[jira] [Commented] (NUTCH-3102) CrawlDbReader -stats fails with Cannot add NaN to t-digest
Marcos Gomez (Jira)
[jira] [Commented] (NUTCH-3102) CrawlDbReader -stats fails with Cannot add NaN to t-digest
Sebastian Nagel (Jira)
[jira] [Updated] (NUTCH-3102) CrawlDbReader -stats fails with Cannot add NaN to t-digest
Sebastian Nagel (Jira)
[jira] [Updated] (NUTCH-3102) CrawlDbReader -stats fails with Cannot add NaN to t-digest
Sebastian Nagel (Jira)
[jira] [Commented] (NUTCH-3101) LinkDb's Inlink class to support metadata
Sebastian Nagel (Jira)
[jira] [Commented] (NUTCH-3101) LinkDb's Inlink class to support metadata
Markus Jelsma (Jira)
[jira] [Commented] (NUTCH-3101) LinkDb's Inlink class to support metadata
Markus Jelsma (Jira)
[jira] [Commented] (NUTCH-3101) LinkDb's Inlink class to support metadata
Hudson (Jira)
[jira] [Resolved] (NUTCH-3072) Fetcher to stop QueueFeeder if aborting with "hung threads"
Sebastian Nagel (Jira)
[jira] [Resolved] (NUTCH-3086) Consolidate plugin extension names and IDs
Sebastian Nagel (Jira)
[jira] [Updated] (NUTCH-3097) Plugin indexer-elastic throws ClassNotFoundException due to invalid dependencies
Sebastian Nagel (Jira)
[jira] [Resolved] (NUTCH-3097) Plugin indexer-elastic throws ClassNotFoundException due to invalid dependencies
Sebastian Nagel (Jira)
Re: [PR] NUTCH-3097 fixed dependencies for indexer-elastic [nutch]
via GitHub
Re: [PR] NUTCH-3097 fixed dependencies for indexer-elastic [nutch]
via GitHub
[jira] [Created] (NUTCH-3102) CrawlDbReader -stats fails with Cannot add NaN to t-digest
Marcos Gomez (Jira)
[jira] [Updated] (NUTCH-3101) LinkDb's Inlink class to support metadata
Markus Jelsma (Jira)
[jira] [Updated] (NUTCH-3101) LinkDb's Inlink class to support metadata
Markus Jelsma (Jira)
[jira] [Created] (NUTCH-3101) LinkDb's Inlink class to support metadata
Markus Jelsma (Jira)
[PR] Main [nutch]
via GitHub
Re: [PR] Main [nutch]
via GitHub
[jira] [Commented] (NUTCH-3100) HostDB to support minimum records per host
Markus Jelsma (Jira)
[jira] [Commented] (NUTCH-3100) HostDB to support minimum records per host
Sebastian Nagel (Jira)
[jira] [Commented] (NUTCH-3100) HostDB to support minimum records per host
Markus Jelsma (Jira)
[jira] [Commented] (NUTCH-3100) HostDB to support minimum records per host
Hudson (Jira)
[jira] [Created] (NUTCH-3100) HostDB to support minimum records per host
Markus Jelsma (Jira)
[jira] [Updated] (NUTCH-3100) HostDB to support minimum records per host
Markus Jelsma (Jira)
[jira] [Created] (NUTCH-3099) Allow wildcard '*' in http.proxy.exception.list
Isabelle Giguere (Jira)
[jira] [Updated] (NUTCH-3098) Docker Setup from Readme does not work
Adrian Kunz (Jira)
[jira] [Created] (NUTCH-3098) Docker Setup from Readme does not work
Adrian Kunz (Jira)
[PR] NUTCH-3087 BasicURLNormalizer to keep userinfo for protocols which mi…ght require it [nutch]
via GitHub
Re: [PR] NUTCH-3087 BasicURLNormalizer to keep userinfo for protocols which might require it [nutch]
via GitHub
[jira] [Resolved] (NUTCH-3079) Dumping a segment fails unless it has been fetched and parsed
Sebastian Nagel (Jira)
[jira] [Resolved] (NUTCH-3083) Add RobotRulesParser to bin/nutch
Sebastian Nagel (Jira)
[jira] [Resolved] (NUTCH-3096) HostDB ResolverThread can create too many job counters
Sebastian Nagel (Jira)
[jira] [Commented] (NUTCH-3097) Plugin indexer-elastic throws ClassNotFoundException due to invalid dependencies
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3097) Plugin indexer-elastic throws ClassNotFoundException due to invalid dependencies
Sebastian Nagel (Jira)
[jira] [Commented] (NUTCH-3097) Plugin indexer-elastic throws ClassNotFoundException due to invalid dependencies
Maciej Puzianowski (Jira)
[jira] [Commented] (NUTCH-3097) Plugin indexer-elastic throws ClassNotFoundException due to invalid dependencies
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3097) Plugin indexer-elastic throws ClassNotFoundException due to invalid dependencies
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3097) Plugin indexer-elastic throws ClassNotFoundException due to invalid dependencies
Hudson (Jira)
[jira] [Created] (NUTCH-3097) Plugin indexer-elastic throws ClassNotFoundException due to invalid dependencies
Maciej Puzianowski (Jira)
[jira] [Reopened] (NUTCH-3094) Github tests to run if build configuration changes
Sebastian Nagel (Jira)
[jira] [Resolved] (NUTCH-3094) Github tests to run if build configuration changes
Sebastian Nagel (Jira)
[jira] [Resolved] (NUTCH-3094) Github tests to run if build configuration changes
Sebastian Nagel (Jira)
[jira] [Resolved] (NUTCH-3095) Update .gitignore to ignore Hadoop native libraries
Sebastian Nagel (Jira)
[jira] [Commented] (NUTCH-3096) HostDB ResolverThread can create too many job counters
Markus Jelsma (Jira)
[jira] [Commented] (NUTCH-3096) HostDB ResolverThread can create too many job counters
Sebastian Nagel (Jira)
[jira] [Commented] (NUTCH-3096) HostDB ResolverThread can create too many job counters
Markus Jelsma (Jira)
[jira] [Commented] (NUTCH-3096) HostDB ResolverThread can create too many job counters
Markus Jelsma (Jira)
[jira] [Commented] (NUTCH-3096) HostDB ResolverThread can create too many job counters
Sebastian Nagel (Jira)
[jira] [Commented] (NUTCH-3096) HostDB ResolverThread can create too many job counters
Sebastian Nagel (Jira)
[jira] [Commented] (NUTCH-3096) HostDB ResolverThread can create too many job counters
Hudson (Jira)
[jira] [Updated] (NUTCH-3096) HostDB ResolverThread can create too many job counters
Markus Jelsma (Jira)
[jira] [Updated] (NUTCH-3096) HostDB ResolverThread can create too many job counters
Markus Jelsma (Jira)
[jira] [Updated] (NUTCH-3096) HostDB ResolverThread can create too many job counters
Markus Jelsma (Jira)
[jira] [Created] (NUTCH-3096) HostDB ResolverThread can create too many job counters
Markus Jelsma (Jira)
[jira] [Assigned] (NUTCH-3093) Ant target test-plugins to depend on compile-core-test
Lewis John McGibbney (Jira)
[jira] [Resolved] (NUTCH-3093) Ant target test-plugins to depend on compile-core-test
Lewis John McGibbney (Jira)
[jira] [Commented] (NUTCH-3093) Ant target test-plugins to depend on compile-core-test
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3095) Update .gitignore to ignore Hadoop native libraries
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3095) Update .gitignore to ignore Hadoop native libraries
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3095) Update .gitignore to ignore Hadoop native libraries
Hudson (Jira)
[PR] NUTCH-3095 Update .gitignore to ignore Hadoop native libraries [nutch]
via GitHub
Re: [PR] NUTCH-3095 Update .gitignore to ignore Hadoop native libraries [nutch]
via GitHub
[jira] [Created] (NUTCH-3095) Update .gitignore to ignore Hadoop native libraries
Sebastian Nagel (Jira)
[PR] NUTCH-3094 Github tests to run if build configuration changes [nutch]
via GitHub
Re: [PR] NUTCH-3094 Github tests to run if build configuration changes [nutch]
via GitHub
Re: [PR] NUTCH-3094 Github tests to run if build configuration changes [nutch]
via GitHub
[PR] NUTCH-3094 Github tests to run if build configuration changes [nutch]
via GitHub
Re: [PR] NUTCH-3094 Github tests to run if build configuration changes [nutch]
via GitHub
[jira] [Created] (NUTCH-3093) Ant target test-plugins to depend on compile-core-test
Sebastian Nagel (Jira)
[jira] [Commented] (NUTCH-3094) Github tests to run if build configuration changes
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3094) Github tests to run if build configuration changes
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3094) Github tests to run if build configuration changes
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3094) Github tests to run if build configuration changes
Sebastian Nagel (Jira)
[jira] [Commented] (NUTCH-3094) Github tests to run if build configuration changes
Hudson (Jira)
[jira] [Commented] (NUTCH-3094) Github tests to run if build configuration changes
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3094) Github tests to run if build configuration changes
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3094) Github tests to run if build configuration changes
Sebastian Nagel (Jira)
[jira] [Commented] (NUTCH-3094) Github tests to run if build configuration changes
Hudson (Jira)
[jira] [Created] (NUTCH-3094) Github tests to run if build configuration changes
Sebastian Nagel (Jira)
[PR] NUTCH-3093 Ant target test-plugins to depend on compile-core-test [nutch]
via GitHub
Re: [PR] NUTCH-3093 Ant target test-plugins to depend on compile-core-test [nutch]
via GitHub
Re: [PR] NUTCH-3093 Ant target test-plugins to depend on compile-core-test [nutch]
via GitHub
[jira] [Commented] (NUTCH-3092) Replace all imports of commons-lang by commons-lang3
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3092) Replace all imports of commons-lang by commons-lang3
ASF GitHub Bot (Jira)
[PR] NUTCH-3092 Replace all imports of commons-lang by commons-lang3 [nutch]
via GitHub
Re: [PR] NUTCH-3092 Replace all imports of commons-lang by commons-lang3 [nutch]
via GitHub
Re: [PR] NUTCH-3092 Replace all imports of commons-lang by commons-lang3 [nutch]
via GitHub
Re: [PR] NUTCH-3092 Replace all imports of commons-lang by commons-lang3 [nutch]
via GitHub
[jira] [Created] (NUTCH-3092) Replace all imports of commons-lang by commons-lang3
Sebastian Nagel (Jira)
[jira] [Commented] (NUTCH-3091) Allow URL filters to flag an existing URL to delete from index
Marcos Gomez (Jira)
[jira] [Commented] (NUTCH-3091) Allow URL filters to flag an existing URL to delete from index
Sebastian Nagel (Jira)
[jira] [Commented] (NUTCH-3091) Allow URL filters to flag an existing URL to delete from index
Marcos Gomez (Jira)
[jira] [Updated] (NUTCH-3091) Allow URL filters to flag an existing URL to delete from index
Marcos Gomez (Jira)
[jira] [Updated] (NUTCH-3091) Allow URL filters to flag an existing URL to delete from index
Marcos Gomez (Jira)
[jira] [Created] (NUTCH-3091) Allow URL filters to flag an existing URL to delete from index
Marcos Gomez (Jira)
[jira] [Comment Edited] (NUTCH-3089) Review MIME type detection
Hiran Chaudhuri (Jira)
[jira] [Created] (NUTCH-3090) Plugin for MIME type detection
Sebastian Nagel (Jira)
[jira] [Created] (NUTCH-3089) Review MIME type detection
Sebastian Nagel (Jira)
[jira] [Commented] (NUTCH-3089) Review MIME type detection
Tim Allison (Jira)
[jira] [Commented] (NUTCH-3089) Review MIME type detection
Sebastian Nagel (Jira)
[jira] [Commented] (NUTCH-3089) Review MIME type detection
Hiran Chaudhuri (Jira)
[jira] [Commented] (NUTCH-3089) Review MIME type detection
Tim Allison (Jira)
[jira] [Commented] (NUTCH-3089) Review MIME type detection
Sebastian Nagel (Jira)
[jira] [Commented] (NUTCH-3089) Review MIME type detection
Hiran Chaudhuri (Jira)
[jira] [Reopened] (NUTCH-2599) charset detection issue with parse-tika
Sebastian Nagel (Jira)
[jira] [Updated] (NUTCH-2599) charset detection issue with parse-tika
Sebastian Nagel (Jira)
[jira] [Assigned] (NUTCH-2599) charset detection issue with parse-tika
Sebastian Nagel (Jira)
[jira] [Resolved] (NUTCH-2599) charset detection issue with parse-tika
Sebastian Nagel (Jira)
[jira] [Updated] (NUTCH-3003) Consider integration testing in a Dockerized mini-hadoop cluster via testcontainers?
Lewis John McGibbney (Jira)
[jira] [Work started] (NUTCH-2157) Parent Issue for Addressing Miredot REST API Warnings
Lewis John McGibbney (Jira)
[jira] [Resolved] (NUTCH-2157) Parent Issue for Addressing Miredot REST API Warnings
Lewis John McGibbney (Jira)
[jira] [Closed] (NUTCH-2157) Parent Issue for Addressing Miredot REST API Warnings
Lewis John McGibbney (Jira)
[jira] [Assigned] (NUTCH-2157) Parent Issue for Addressing Miredot REST API Warnings
Lewis John McGibbney (Jira)
[jira] [Assigned] (NUTCH-2925) Secure the Nutch REST API using Apache Shiro
Lewis John McGibbney (Jira)
[jira] [Resolved] (NUTCH-3088) Parsechecker command does not use protocol plugins
Hiran Chaudhuri (Jira)
[jira] [Commented] (NUTCH-3088) Parsechecker command does not use protocol plugins
Hiran Chaudhuri (Jira)
[jira] [Updated] (NUTCH-3088) Parsechecker command does not use protocol plugins
Hiran Chaudhuri (Jira)
[jira] [Updated] (NUTCH-3088) Parsechecker command does not use plugins
Hiran Chaudhuri (Jira)
[jira] [Updated] (NUTCH-3088) Parsechecker command does not load plugins
Hiran Chaudhuri (Jira)
[jira] [Created] (NUTCH-3088) Parsechecker command does not load plugins
Hiran Chaudhuri (Jira)
[jira] [Closed] (NUTCH-3075) tld plugin makes injector crash
Hiran Chaudhuri (Jira)
[jira] [Closed] (NUTCH-3070) Documentation has outdated links
Hiran Chaudhuri (Jira)
[jira] [Commented] (NUTCH-1086) Rewrite protocol-httpclient
Hiran Chaudhuri (Jira)
[jira] [Comment Edited] (NUTCH-3087) Nutch crawling inconsistent on URLs with userinfo
Hiran Chaudhuri (Jira)
[jira] [Comment Edited] (NUTCH-3087) Nutch crawling inconsistent on URLs with userinfo
Hiran Chaudhuri (Jira)
[jira] [Comment Edited] (NUTCH-3087) Nutch crawling inconsistent on URLs with userinfo
Hiran Chaudhuri (Jira)
[jira] [Comment Edited] (NUTCH-3087) Nutch crawling inconsistent on URLs with userinfo
Hiran Chaudhuri (Jira)
[jira] [Comment Edited] (NUTCH-3087) Nutch crawling inconsistent on URLs with userinfo
Hiran Chaudhuri (Jira)
[jira] [Commented] (NUTCH-3087) Nutch crawling inconsistent on URLs with userinfo
Hiran Chaudhuri (Jira)
[jira] [Commented] (NUTCH-3087) Nutch crawling inconsistent on URLs with userinfo
Sebastian Nagel (Jira)
[jira] [Commented] (NUTCH-3087) Nutch crawling inconsistent on URLs with userinfo
Hiran Chaudhuri (Jira)
Earlier messages