[
https://issues.apache.org/jira/browse/NUTCH-3113?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel resolved NUTCH-3113.
------------------------------------
Resolution: Implemented
> Group commands in bin/nutch command-line help
> ---------------------------------------------
>
> Key: NUTCH-3113
> URL: https://issues.apache.org/jira/browse/NUTCH-3113
> Project: Nutch
> Issue Type: Improvement
> Components: CLI
> Affects Versions: 1.20
> Reporter: Sebastian Nagel
> Assignee: Sebastian Nagel
> Priority: Major
> Fix For: 1.21
>
>
> The 38 commands in the command-line help of bin/nutch appear in a long,
> unstructured list. Grouping the commands thematically may help to find the
> appropriate command. A PR is under way, the output will look like:
> {noformat}
> (Crawl commands)
> inject inject new urls into the database
> generate generate new segments to fetch from crawl db
> fetch fetch a segment's pages
> parse parse a segment's pages
> updatedb update crawl db from segments after fetching
> (CrawlDb commands)
> readdb read / dump crawl db
> mergedb merge crawldb-s, with optional filtering
> dedup deduplicate entries in the crawldb and assign them a
> special status
> domainstats calculate domain statistics from crawldb
> protocolstats calculate protocol status code stats from crawldb
> crawlcomplete calculate crawl completion stats from crawldb
> (Segment tools)
> ...
> {noformat}
--
This message was sent by Atlassian Jira
(v8.20.10#820010)