[ 
https://issues.apache.org/jira/browse/NUTCH-3017?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17784047#comment-17784047
 ] 

Hudson commented on NUTCH-3017:
-------------------------------

SUCCESS: Integrated in Jenkins build Nutch ยป Nutch-trunk #141 (See 
[https://ci-builds.apache.org/job/Nutch/job/Nutch-trunk/141/])
[NUTCH-3017] Allow fast-urlfilter to load from HDFS/S3 and support gzipped 
input (julien: 
[https://github.com/apache/nutch/commit/d1025fd634e79f2f384131ca2776f346aa446902])
* (edit) 
src/plugin/urlfilter-fast/src/java/org/apache/nutch/urlfilter/fast/FastURLFilter.java
[NUTCH-3017] Allow fast-urlfilter to load from HDFS/S3 and support gzipped 
input (snagel: 
[https://github.com/apache/nutch/commit/ac383fc5125b6c114a23ef996558ead57e873970])
* (edit) 
src/plugin/urlfilter-fast/src/java/org/apache/nutch/urlfilter/fast/FastURLFilter.java
* (edit) conf/nutch-default.xml


> Allow fast-urlfilter to load from HDFS/S3 and support gzipped input
> -------------------------------------------------------------------
>
>                 Key: NUTCH-3017
>                 URL: https://issues.apache.org/jira/browse/NUTCH-3017
>             Project: Nutch
>          Issue Type: Improvement
>          Components: plugin, urlfilter
>    Affects Versions: 1.19
>            Reporter: Julien Nioche
>            Priority: Minor
>             Fix For: 1.20
>
>
> This provide an easier way to refresh the resources since no rebuild of the 
> jar will be needed. The path can point to either HDFS or S3. Additionally, 
> .gz files should be handled automatically



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to