[
https://issues.apache.org/jira/browse/NUTCH-2218?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Michael Joyce resolved NUTCH-2218.
----------------------------------
Resolution: Fixed
[~lewismc], This got merged. I added an example to the option you raised as
well. If that doesn't address your concerns let me know and I'll update in
another ticket.
{code}
| -> ./bin/nutch crawlcomplete
usage: CrawlCompletionStats [-h] -inputDirs <inputDirs> -mode <mode>
[-numReducers <numReducers>] -outputDir <outputDir>
-h,--help Show this message
-inputDirs <inputDirs> Comma separated list of crawl directories
(e.g., "./crawl1,./crawl2")
-mode <mode> Set statistics gathering mode (by 'host' or
by 'domain')
-numReducers <numReducers> Optional number of reduce jobs to use.
Defaults to 1
-outputDir <outputDir> Output directory where results should be
dumped
{code}
> Switch CrawlCompletion arg parsing to Commons CLI
> -------------------------------------------------
>
> Key: NUTCH-2218
> URL: https://issues.apache.org/jira/browse/NUTCH-2218
> Project: Nutch
> Issue Type: Improvement
> Components: util
> Affects Versions: 1.11
> Reporter: Michael Joyce
> Assignee: Michael Joyce
> Priority: Minor
> Fix For: 1.12
>
>
> The current CrawlCompletion utility should be updated to use commons CLI
> instead of doing manual arg parsing and checking.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)