Sebastian Nagel created NUTCH-3113: -------------------------------------- Summary: Group commands in bin/nutch command-line help Key: NUTCH-3113 URL: https://issues.apache.org/jira/browse/NUTCH-3113 Project: Nutch Issue Type: Improvement Components: CLI Affects Versions: 1.20 Reporter: Sebastian Nagel Fix For: 1.21
The 38 commands in the command-line help of bin/nutch appear in a long, unstructured list. Grouping the commands thematically may help to find the appropriate command. A PR is under way, the output will look like: {noformat} (Crawl commands) inject inject new urls into the database generate generate new segments to fetch from crawl db fetch fetch a segment's pages parse parse a segment's pages updatedb update crawl db from segments after fetching (CrawlDb commands) readdb read / dump crawl db mergedb merge crawldb-s, with optional filtering dedup deduplicate entries in the crawldb and assign them a special status domainstats calculate domain statistics from crawldb protocolstats calculate protocol status code stats from crawldb crawlcomplete calculate crawl completion stats from crawldb (Segment tools) ... {noformat} -- This message was sent by Atlassian Jira (v8.20.10#820010)