[jira] [Commented] (NUTCH-3017) Allow fast-urlfilter to load from HDFS/S3 and support gzipped input
[ https://issues.apache.org/jira/browse/NUTCH-3017?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17781088#comment-17781088 ] ASF GitHub Bot commented on NUTCH-3017: --- jnioche opened a new pull request, #793: URL: https://github.com/apache/nutch/pull/793 and support gzipped input See https://issues.apache.org/jira/browse/NUTCH-3017 > Allow fast-urlfilter to load from HDFS/S3 and support gzipped input > --- > > Key: NUTCH-3017 > URL: https://issues.apache.org/jira/browse/NUTCH-3017 > Project: Nutch > Issue Type: Improvement > Components: plugin, urlfilter >Affects Versions: 1.19 >Reporter: Julien Nioche >Priority: Minor > Fix For: 1.20 > > > This provide an easier way to refresh the resources since no rebuild of the > jar will be needed. The path can point to either HDFS or S3. Additionally, > .gz files should be handled automatically -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Commented] (NUTCH-3017) Allow fast-urlfilter to load from HDFS/S3 and support gzipped input
[ https://issues.apache.org/jira/browse/NUTCH-3017?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17781080#comment-17781080 ] ASF GitHub Bot commented on NUTCH-3017: --- jnioche commented on PR #792: URL: https://github.com/apache/nutch/pull/792#issuecomment-1785804884 Obivously, pulled more changes than I meant to > Allow fast-urlfilter to load from HDFS/S3 and support gzipped input > --- > > Key: NUTCH-3017 > URL: https://issues.apache.org/jira/browse/NUTCH-3017 > Project: Nutch > Issue Type: Improvement > Components: plugin, urlfilter >Affects Versions: 1.19 >Reporter: Julien Nioche >Priority: Minor > Fix For: 1.20 > > > This provide an easier way to refresh the resources since no rebuild of the > jar will be needed. The path can point to either HDFS or S3. Additionally, > .gz files should be handled automatically -- This message was sent by Atlassian Jira (v8.20.10#820010)
Re: [PR] Allow fast-urlfilter to load from HDFS/S3 and support gzipped input [NUTCH-3017] [nutch]
jnioche commented on PR #792: URL: https://github.com/apache/nutch/pull/792#issuecomment-1785804884 Obivously, pulled more changes than I meant to -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: dev-unsubscr...@nutch.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org
[jira] [Commented] (NUTCH-3017) Allow fast-urlfilter to load from HDFS/S3 and support gzipped input
[ https://issues.apache.org/jira/browse/NUTCH-3017?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17781079#comment-17781079 ] ASF GitHub Bot commented on NUTCH-3017: --- jnioche closed pull request #792: Allow fast-urlfilter to load from HDFS/S3 and support gzipped input [NUTCH-3017] URL: https://github.com/apache/nutch/pull/792 > Allow fast-urlfilter to load from HDFS/S3 and support gzipped input > --- > > Key: NUTCH-3017 > URL: https://issues.apache.org/jira/browse/NUTCH-3017 > Project: Nutch > Issue Type: Improvement > Components: plugin, urlfilter >Affects Versions: 1.19 >Reporter: Julien Nioche >Priority: Minor > Fix For: 1.20 > > > This provide an easier way to refresh the resources since no rebuild of the > jar will be needed. The path can point to either HDFS or S3. Additionally, > .gz files should be handled automatically -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Commented] (NUTCH-3017) Allow fast-urlfilter to load from HDFS/S3 and support gzipped input
[ https://issues.apache.org/jira/browse/NUTCH-3017?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17781078#comment-17781078 ] ASF GitHub Bot commented on NUTCH-3017: --- jnioche opened a new pull request, #792: URL: https://github.com/apache/nutch/pull/792 See description in https://issues.apache.org/jira/browse/NUTCH-3017 > Allow fast-urlfilter to load from HDFS/S3 and support gzipped input > --- > > Key: NUTCH-3017 > URL: https://issues.apache.org/jira/browse/NUTCH-3017 > Project: Nutch > Issue Type: Improvement > Components: plugin, urlfilter >Affects Versions: 1.19 >Reporter: Julien Nioche >Priority: Minor > Fix For: 1.20 > > > This provide an easier way to refresh the resources since no rebuild of the > jar will be needed. The path can point to either HDFS or S3. Additionally, > .gz files should be handled automatically -- This message was sent by Atlassian Jira (v8.20.10#820010)
Re: [PR] Allow fast-urlfilter to load from HDFS/S3 and support gzipped input [NUTCH-3017] [nutch]
jnioche closed pull request #792: Allow fast-urlfilter to load from HDFS/S3 and support gzipped input [NUTCH-3017] URL: https://github.com/apache/nutch/pull/792 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: dev-unsubscr...@nutch.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org
[PR] Allow fast-urlfilter to load from HDFS/S3 and support gzipped input [NUTCH-3017] [nutch]
jnioche opened a new pull request, #792: URL: https://github.com/apache/nutch/pull/792 See description in https://issues.apache.org/jira/browse/NUTCH-3017 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: dev-unsubscr...@nutch.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org
[jira] [Updated] (NUTCH-3017) Allow fast-urlfilter to load from HDFS/S3 and support gzipped input
[ https://issues.apache.org/jira/browse/NUTCH-3017?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Nagel updated NUTCH-3017: --- Component/s: plugin urlfilter > Allow fast-urlfilter to load from HDFS/S3 and support gzipped input > --- > > Key: NUTCH-3017 > URL: https://issues.apache.org/jira/browse/NUTCH-3017 > Project: Nutch > Issue Type: Improvement > Components: plugin, urlfilter >Affects Versions: 1.19 >Reporter: Julien Nioche >Priority: Minor > Fix For: 1.20 > > > This provide an easier way to refresh the resources since no rebuild of the > jar will be needed. The path can point to either HDFS or S3. Additionally, > .gz files should be handled automatically -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Updated] (NUTCH-3017) Allow fast-urlfilter to load from HDFS/S3 and support gzipped input
[ https://issues.apache.org/jira/browse/NUTCH-3017?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Nagel updated NUTCH-3017: --- Fix Version/s: 1.20 > Allow fast-urlfilter to load from HDFS/S3 and support gzipped input > --- > > Key: NUTCH-3017 > URL: https://issues.apache.org/jira/browse/NUTCH-3017 > Project: Nutch > Issue Type: Improvement >Affects Versions: 1.19 >Reporter: Julien Nioche >Priority: Minor > Fix For: 1.20 > > > This provide an easier way to refresh the resources since no rebuild of the > jar will be needed. The path can point to either HDFS or S3. Additionally, > .gz files should be handled automatically -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Created] (NUTCH-3017) Allow fast-urlfilter to load from HDFS/S3 and support gzipped input
Julien Nioche created NUTCH-3017: Summary: Allow fast-urlfilter to load from HDFS/S3 and support gzipped input Key: NUTCH-3017 URL: https://issues.apache.org/jira/browse/NUTCH-3017 Project: Nutch Issue Type: Improvement Affects Versions: 1.19 Reporter: Julien Nioche This provide an easier way to refresh the resources since no rebuild of the jar will be needed. The path can point to either HDFS or S3. Additionally, .gz files should be handled automatically -- This message was sent by Atlassian Jira (v8.20.10#820010)
Call for Presentations now open: Community over Code EU 2024
(Note: You are receiving this because you are subscribed to the dev@ list for one or more projects of the Apache Software Foundation.) It's back *and* it's new! We're excited to announce that the first edition of Community over Code Europe (formerly known as ApacheCon EU) which will be held at the Radisson Blu Carlton Hotel in Bratislava, Slovakia from June 03-05, 2024! This eagerly anticipated event will be our first live EU conference since 2019. The Call for Presentations (CFP) for Community Over Code EU 2024 is now open at https://eu.communityovercode.org/blog/cfp-open/, and will close 2024/01/12 23:59:59 GMT. We welcome submissions on any topic related to the Apache Software Foundation, Apache projects, or the communities around those projects. We are specifically looking for presentations in the following categories: * API & Microservices * Big Data Compute * Big Data Storage * Cassandra * CloudStack * Community * Data Engineering * Fintech * Groovy * Incubator * IoT * Performance Engineering * Search * Tomcat, Httpd and other servers Additionally, we are thrilled to introduce a new feature this year: a poster session. This addition will provide an excellent platform for showcasing high-level projects and incubator initiatives in a visually engaging manner. We believe this will foster lively discussions and facilitate networking opportunities among participants. All my best, and thanks so much for your participation, Ryan Skraba (on behalf of the program committee) [Countdown]: https://www.timeanddate.com/countdown/to?iso=20240112T2359&p0=1440