Julien Nioche created NUTCH-3017:
------------------------------------
Summary: Allow fast-urlfilter to load from HDFS/S3 and support
gzipped input
Key: NUTCH-3017
URL: https://issues.apache.org/jira/browse/NUTCH-3017
Project: Nutch
Issue Type: Improvement
Affects Versions: 1.19
Reporter: Julien Nioche
This provide an easier way to refresh the resources since no rebuild of the jar
will be needed. The path can point to either HDFS or S3. Additionally, .gz
files should be handled automatically
--
This message was sent by Atlassian Jira
(v8.20.10#820010)