[ https://issues.apache.org/jira/browse/NUTCH-2046?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15142863#comment-15142863 ]
Julien Nioche commented on NUTCH-2046: -------------------------------------- I agree with the objective but I'd rather have a consistent approach and deal with that in the same way as we do for indexing i.e. [-s seedPath]. Shouldn't be difficult to do > The crawl script should be able to skip an initial injection. > ------------------------------------------------------------- > > Key: NUTCH-2046 > URL: https://issues.apache.org/jira/browse/NUTCH-2046 > Project: Nutch > Issue Type: Improvement > Components: crawldb, injector > Affects Versions: 1.10 > Reporter: Luis Lopez > Assignee: Lewis John McGibbney > Labels: crawl, injection > Fix For: 1.12 > > Attachments: crawl.patch > > > When our crawl gets really big a new injection takes considerable time as it > updates crawldb, the crawl script should be able to skip the injection and go > directly to the generate call. -- This message was sent by Atlassian JIRA (v6.3.4#6332)