Sebastian Nagel created NUTCH-3113:
--------------------------------------

             Summary: Group commands in bin/nutch command-line help
                 Key: NUTCH-3113
                 URL: https://issues.apache.org/jira/browse/NUTCH-3113
             Project: Nutch
          Issue Type: Improvement
          Components: CLI
    Affects Versions: 1.20
            Reporter: Sebastian Nagel
             Fix For: 1.21


The 38 commands in the command-line help of bin/nutch appear in a long, 
unstructured list. Grouping the commands thematically may help to find the 
appropriate command. A PR is under way, the output will look like:

{noformat}
 (Crawl commands)
  inject            inject new urls into the database
  generate          generate new segments to fetch from crawl db
  fetch             fetch a segment's pages
  parse             parse a segment's pages
  updatedb          update crawl db from segments after fetching

 (CrawlDb commands)
  readdb            read / dump crawl db
  mergedb           merge crawldb-s, with optional filtering
  dedup             deduplicate entries in the crawldb and assign them a 
special status
  domainstats       calculate domain statistics from crawldb
  protocolstats     calculate protocol status code stats from crawldb
  crawlcomplete     calculate crawl completion stats from crawldb

 (Segment tools)
   ...
{noformat}



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to