[ 
https://issues.apache.org/jira/browse/NUTCH-3113?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sebastian Nagel reassigned NUTCH-3113:
--------------------------------------

    Assignee: Sebastian Nagel

> Group commands in bin/nutch command-line help
> ---------------------------------------------
>
>                 Key: NUTCH-3113
>                 URL: https://issues.apache.org/jira/browse/NUTCH-3113
>             Project: Nutch
>          Issue Type: Improvement
>          Components: CLI
>    Affects Versions: 1.20
>            Reporter: Sebastian Nagel
>            Assignee: Sebastian Nagel
>            Priority: Major
>             Fix For: 1.21
>
>
> The 38 commands in the command-line help of bin/nutch appear in a long, 
> unstructured list. Grouping the commands thematically may help to find the 
> appropriate command. A PR is under way, the output will look like:
> {noformat}
>  (Crawl commands)
>   inject            inject new urls into the database
>   generate          generate new segments to fetch from crawl db
>   fetch             fetch a segment's pages
>   parse             parse a segment's pages
>   updatedb          update crawl db from segments after fetching
>  (CrawlDb commands)
>   readdb            read / dump crawl db
>   mergedb           merge crawldb-s, with optional filtering
>   dedup             deduplicate entries in the crawldb and assign them a 
> special status
>   domainstats       calculate domain statistics from crawldb
>   protocolstats     calculate protocol status code stats from crawldb
>   crawlcomplete     calculate crawl completion stats from crawldb
>  (Segment tools)
>    ...
> {noformat}



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to