Semyon Semyonov created NUTCH-2510:
--------------------------------------

             Summary: Crawl script modification. HostDb : generate, optional 
usage and descirption
                 Key: NUTCH-2510
                 URL: https://issues.apache.org/jira/browse/NUTCH-2510
             Project: Nutch
          Issue Type: Improvement
          Components: bin
    Affects Versions: 1.15
            Reporter: Semyon Semyonov
             Fix For: 1.14


Script crawl now includes hostdb update as a part of crawling cycle, but :
1) There is no hostdb parameter for generate

2) Generation of hostdb is not optional, therefore hostdb is generated each 
step without asking of user. It should be an optional parameter.

3) Description of 1 and 2.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to