[jira] [Commented] (NUTCH-3017) Allow fast-urlfilter to load from HDFS/S3 and support gzipped input

2023-10-30 Thread ASF GitHub Bot (Jira)


[ 
https://issues.apache.org/jira/browse/NUTCH-3017?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17781088#comment-17781088
 ] 

ASF GitHub Bot commented on NUTCH-3017:
---

jnioche opened a new pull request, #793:
URL: https://github.com/apache/nutch/pull/793

   and support gzipped input
   
   See https://issues.apache.org/jira/browse/NUTCH-3017




> Allow fast-urlfilter to load from HDFS/S3 and support gzipped input
> ---
>
> Key: NUTCH-3017
> URL: https://issues.apache.org/jira/browse/NUTCH-3017
> Project: Nutch
>  Issue Type: Improvement
>  Components: plugin, urlfilter
>Affects Versions: 1.19
>Reporter: Julien Nioche
>Priority: Minor
> Fix For: 1.20
>
>
> This provide an easier way to refresh the resources since no rebuild of the 
> jar will be needed. The path can point to either HDFS or S3. Additionally, 
> .gz files should be handled automatically



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Commented] (NUTCH-3017) Allow fast-urlfilter to load from HDFS/S3 and support gzipped input

2023-10-30 Thread ASF GitHub Bot (Jira)


[ 
https://issues.apache.org/jira/browse/NUTCH-3017?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17781080#comment-17781080
 ] 

ASF GitHub Bot commented on NUTCH-3017:
---

jnioche commented on PR #792:
URL: https://github.com/apache/nutch/pull/792#issuecomment-1785804884

   Obivously, pulled more changes than I meant to




> Allow fast-urlfilter to load from HDFS/S3 and support gzipped input
> ---
>
> Key: NUTCH-3017
> URL: https://issues.apache.org/jira/browse/NUTCH-3017
> Project: Nutch
>  Issue Type: Improvement
>  Components: plugin, urlfilter
>Affects Versions: 1.19
>Reporter: Julien Nioche
>Priority: Minor
> Fix For: 1.20
>
>
> This provide an easier way to refresh the resources since no rebuild of the 
> jar will be needed. The path can point to either HDFS or S3. Additionally, 
> .gz files should be handled automatically



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


Re: [PR] Allow fast-urlfilter to load from HDFS/S3 and support gzipped input [NUTCH-3017] [nutch]

2023-10-30 Thread via GitHub


jnioche commented on PR #792:
URL: https://github.com/apache/nutch/pull/792#issuecomment-1785804884

   Obivously, pulled more changes than I meant to


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: dev-unsubscr...@nutch.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



[jira] [Commented] (NUTCH-3017) Allow fast-urlfilter to load from HDFS/S3 and support gzipped input

2023-10-30 Thread ASF GitHub Bot (Jira)


[ 
https://issues.apache.org/jira/browse/NUTCH-3017?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17781079#comment-17781079
 ] 

ASF GitHub Bot commented on NUTCH-3017:
---

jnioche closed pull request #792: Allow fast-urlfilter to load from HDFS/S3 and 
support gzipped input [NUTCH-3017]
URL: https://github.com/apache/nutch/pull/792




> Allow fast-urlfilter to load from HDFS/S3 and support gzipped input
> ---
>
> Key: NUTCH-3017
> URL: https://issues.apache.org/jira/browse/NUTCH-3017
> Project: Nutch
>  Issue Type: Improvement
>  Components: plugin, urlfilter
>Affects Versions: 1.19
>Reporter: Julien Nioche
>Priority: Minor
> Fix For: 1.20
>
>
> This provide an easier way to refresh the resources since no rebuild of the 
> jar will be needed. The path can point to either HDFS or S3. Additionally, 
> .gz files should be handled automatically



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Commented] (NUTCH-3017) Allow fast-urlfilter to load from HDFS/S3 and support gzipped input

2023-10-30 Thread ASF GitHub Bot (Jira)


[ 
https://issues.apache.org/jira/browse/NUTCH-3017?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17781078#comment-17781078
 ] 

ASF GitHub Bot commented on NUTCH-3017:
---

jnioche opened a new pull request, #792:
URL: https://github.com/apache/nutch/pull/792

   See description in https://issues.apache.org/jira/browse/NUTCH-3017
   




> Allow fast-urlfilter to load from HDFS/S3 and support gzipped input
> ---
>
> Key: NUTCH-3017
> URL: https://issues.apache.org/jira/browse/NUTCH-3017
> Project: Nutch
>  Issue Type: Improvement
>  Components: plugin, urlfilter
>Affects Versions: 1.19
>Reporter: Julien Nioche
>Priority: Minor
> Fix For: 1.20
>
>
> This provide an easier way to refresh the resources since no rebuild of the 
> jar will be needed. The path can point to either HDFS or S3. Additionally, 
> .gz files should be handled automatically



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


Re: [PR] Allow fast-urlfilter to load from HDFS/S3 and support gzipped input [NUTCH-3017] [nutch]

2023-10-30 Thread via GitHub


jnioche closed pull request #792: Allow fast-urlfilter to load from HDFS/S3 and 
support gzipped input [NUTCH-3017]
URL: https://github.com/apache/nutch/pull/792


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: dev-unsubscr...@nutch.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



[PR] Allow fast-urlfilter to load from HDFS/S3 and support gzipped input [NUTCH-3017] [nutch]

2023-10-30 Thread via GitHub


jnioche opened a new pull request, #792:
URL: https://github.com/apache/nutch/pull/792

   See description in https://issues.apache.org/jira/browse/NUTCH-3017
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: dev-unsubscr...@nutch.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



[jira] [Updated] (NUTCH-3017) Allow fast-urlfilter to load from HDFS/S3 and support gzipped input

2023-10-30 Thread Sebastian Nagel (Jira)


 [ 
https://issues.apache.org/jira/browse/NUTCH-3017?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sebastian Nagel updated NUTCH-3017:
---
Component/s: plugin
 urlfilter

> Allow fast-urlfilter to load from HDFS/S3 and support gzipped input
> ---
>
> Key: NUTCH-3017
> URL: https://issues.apache.org/jira/browse/NUTCH-3017
> Project: Nutch
>  Issue Type: Improvement
>  Components: plugin, urlfilter
>Affects Versions: 1.19
>Reporter: Julien Nioche
>Priority: Minor
> Fix For: 1.20
>
>
> This provide an easier way to refresh the resources since no rebuild of the 
> jar will be needed. The path can point to either HDFS or S3. Additionally, 
> .gz files should be handled automatically



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Updated] (NUTCH-3017) Allow fast-urlfilter to load from HDFS/S3 and support gzipped input

2023-10-30 Thread Sebastian Nagel (Jira)


 [ 
https://issues.apache.org/jira/browse/NUTCH-3017?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sebastian Nagel updated NUTCH-3017:
---
Fix Version/s: 1.20

> Allow fast-urlfilter to load from HDFS/S3 and support gzipped input
> ---
>
> Key: NUTCH-3017
> URL: https://issues.apache.org/jira/browse/NUTCH-3017
> Project: Nutch
>  Issue Type: Improvement
>Affects Versions: 1.19
>Reporter: Julien Nioche
>Priority: Minor
> Fix For: 1.20
>
>
> This provide an easier way to refresh the resources since no rebuild of the 
> jar will be needed. The path can point to either HDFS or S3. Additionally, 
> .gz files should be handled automatically



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Created] (NUTCH-3017) Allow fast-urlfilter to load from HDFS/S3 and support gzipped input

2023-10-30 Thread Julien Nioche (Jira)
Julien Nioche created NUTCH-3017:


 Summary: Allow fast-urlfilter to load from HDFS/S3 and support 
gzipped input
 Key: NUTCH-3017
 URL: https://issues.apache.org/jira/browse/NUTCH-3017
 Project: Nutch
  Issue Type: Improvement
Affects Versions: 1.19
Reporter: Julien Nioche


This provide an easier way to refresh the resources since no rebuild of the jar 
will be needed. The path can point to either HDFS or S3. Additionally, .gz 
files should be handled automatically



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


Call for Presentations now open: Community over Code EU 2024

2023-10-30 Thread Ryan Skraba
(Note: You are receiving this because you are subscribed to the dev@
list for one or more projects of the Apache Software Foundation.)

It's back *and* it's new!

We're excited to announce that the first edition of Community over
Code Europe (formerly known as ApacheCon EU) which will be held at the
Radisson Blu Carlton Hotel in Bratislava, Slovakia from June 03-05,
2024! This eagerly anticipated event will be our first live EU
conference since 2019.

The Call for Presentations (CFP) for Community Over Code EU 2024 is
now open at https://eu.communityovercode.org/blog/cfp-open/,
and will close 2024/01/12 23:59:59 GMT.

We welcome submissions on any topic related to the Apache Software
Foundation, Apache projects, or the communities around those projects.
We are specifically looking for presentations in the following
categories:

* API & Microservices
* Big Data Compute
* Big Data Storage
* Cassandra
* CloudStack
* Community
* Data Engineering
* Fintech
* Groovy
* Incubator
* IoT
* Performance Engineering
* Search
* Tomcat, Httpd and other servers

Additionally, we are thrilled to introduce a new feature this year: a
poster session. This addition will provide an excellent platform for
showcasing high-level projects and incubator initiatives in a visually
engaging manner. We believe this will foster lively discussions and
facilitate networking opportunities among participants.

All my best, and thanks so much for your participation,

Ryan Skraba (on behalf of the program committee)

[Countdown]: https://www.timeanddate.com/countdown/to?iso=20240112T2359&p0=1440