[ https://issues.apache.org/jira/browse/NUTCH-2510?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16364079#comment-16364079 ]
Semyon Semyonov commented on NUTCH-2510: ---------------------------------------- I have provided the pull request. There are two indicator flags for the script: 1) To update hostdb(but not use it in generate) put --hostdbupdate 2) To update hostdb and use it in generate use both --hostdbgenerate --hostdbupdate > Crawl script modification. HostDb : generate, optional usage and descirption > ---------------------------------------------------------------------------- > > Key: NUTCH-2510 > URL: https://issues.apache.org/jira/browse/NUTCH-2510 > Project: Nutch > Issue Type: Improvement > Components: bin > Affects Versions: 1.15 > Reporter: Semyon Semyonov > Priority: Minor > Fix For: 1.14 > > > Script crawl now includes hostdb update as a part of crawling cycle, but : > 1) There is no hostdb parameter for generate > 2) Generation of hostdb is not optional, therefore hostdb is generated each > step without asking of user. It should be an optional parameter. > 3) Description of 1 and 2. -- This message was sent by Atlassian JIRA (v7.6.3#76005)