[ https://issues.apache.org/jira/browse/NUTCH-3113?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Sebastian Nagel reassigned NUTCH-3113: -------------------------------------- Assignee: Sebastian Nagel > Group commands in bin/nutch command-line help > --------------------------------------------- > > Key: NUTCH-3113 > URL: https://issues.apache.org/jira/browse/NUTCH-3113 > Project: Nutch > Issue Type: Improvement > Components: CLI > Affects Versions: 1.20 > Reporter: Sebastian Nagel > Assignee: Sebastian Nagel > Priority: Major > Fix For: 1.21 > > > The 38 commands in the command-line help of bin/nutch appear in a long, > unstructured list. Grouping the commands thematically may help to find the > appropriate command. A PR is under way, the output will look like: > {noformat} > (Crawl commands) > inject inject new urls into the database > generate generate new segments to fetch from crawl db > fetch fetch a segment's pages > parse parse a segment's pages > updatedb update crawl db from segments after fetching > (CrawlDb commands) > readdb read / dump crawl db > mergedb merge crawldb-s, with optional filtering > dedup deduplicate entries in the crawldb and assign them a > special status > domainstats calculate domain statistics from crawldb > protocolstats calculate protocol status code stats from crawldb > crawlcomplete calculate crawl completion stats from crawldb > (Segment tools) > ... > {noformat} -- This message was sent by Atlassian Jira (v8.20.10#820010)