Giuseppe Totaro created NUTCH-1998:
--------------------------------------
Summary: Add support for user-defined file extension to
CommonCrawlDataDumper
Key: NUTCH-1998
URL: https://issues.apache.org/jira/browse/NUTCH-1998
Project: Nutch
Issue Type: Improvement
Components: tool
Reporter: Giuseppe Totaro
Priority: Minor
{{CommonCrawlDataDumper}} tool is able to generate CBOR-encoded files,
extracted from Nutch crawled data, using the Common Crawl format. By default,
{{CommonCrawlDataDumper}} uses the original file extension.
We are going to add support for a command-line option (e.g., {{-extension}})
that allows the user to provide a file extension to use in place of the
original one.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)