Sebastian Nagel created NUTCH-3113:
--------------------------------------
Summary: Group commands in bin/nutch command-line help
Key: NUTCH-3113
URL: https://issues.apache.org/jira/browse/NUTCH-3113
Project: Nutch
Issue Type: Improvement
Components: CLI
Affects Versions: 1.20
Reporter: Sebastian Nagel
Fix For: 1.21
The 38 commands in the command-line help of bin/nutch appear in a long,
unstructured list. Grouping the commands thematically may help to find the
appropriate command. A PR is under way, the output will look like:
{noformat}
(Crawl commands)
inject inject new urls into the database
generate generate new segments to fetch from crawl db
fetch fetch a segment's pages
parse parse a segment's pages
updatedb update crawl db from segments after fetching
(CrawlDb commands)
readdb read / dump crawl db
mergedb merge crawldb-s, with optional filtering
dedup deduplicate entries in the crawldb and assign them a
special status
domainstats calculate domain statistics from crawldb
protocolstats calculate protocol status code stats from crawldb
crawlcomplete calculate crawl completion stats from crawldb
(Segment tools)
...
{noformat}
--
This message was sent by Atlassian Jira
(v8.20.10#820010)