[
https://issues.apache.org/jira/browse/NUTCH-3017?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17784047#comment-17784047
]
Hudson commented on NUTCH-3017:
-------------------------------
SUCCESS: Integrated in Jenkins build Nutch ยป Nutch-trunk #141 (See
[https://ci-builds.apache.org/job/Nutch/job/Nutch-trunk/141/])
[NUTCH-3017] Allow fast-urlfilter to load from HDFS/S3 and support gzipped
input (julien:
[https://github.com/apache/nutch/commit/d1025fd634e79f2f384131ca2776f346aa446902])
* (edit)
src/plugin/urlfilter-fast/src/java/org/apache/nutch/urlfilter/fast/FastURLFilter.java
[NUTCH-3017] Allow fast-urlfilter to load from HDFS/S3 and support gzipped
input (snagel:
[https://github.com/apache/nutch/commit/ac383fc5125b6c114a23ef996558ead57e873970])
* (edit)
src/plugin/urlfilter-fast/src/java/org/apache/nutch/urlfilter/fast/FastURLFilter.java
* (edit) conf/nutch-default.xml
> Allow fast-urlfilter to load from HDFS/S3 and support gzipped input
> -------------------------------------------------------------------
>
> Key: NUTCH-3017
> URL: https://issues.apache.org/jira/browse/NUTCH-3017
> Project: Nutch
> Issue Type: Improvement
> Components: plugin, urlfilter
> Affects Versions: 1.19
> Reporter: Julien Nioche
> Priority: Minor
> Fix For: 1.20
>
>
> This provide an easier way to refresh the resources since no rebuild of the
> jar will be needed. The path can point to either HDFS or S3. Additionally,
> .gz files should be handled automatically
--
This message was sent by Atlassian Jira
(v8.20.10#820010)