[ https://issues.apache.org/jira/browse/NUTCH-2495?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Sebastian Nagel updated NUTCH-2495: ----------------------------------- Patch Info: Patch Available > Use -deleteGone instead of clean job in crawler script while indexing > --------------------------------------------------------------------- > > Key: NUTCH-2495 > URL: https://issues.apache.org/jira/browse/NUTCH-2495 > Project: Nutch > Issue Type: Improvement > Components: bin > Affects Versions: 1.15 > Reporter: Moreno Feltscher > Assignee: Lewis John McGibbney > Priority: Major > Fix For: 1.17 > > > Instead of running {{bin/nutch clean}} after indexing the documents run > {{bin/nutch index}} with the {{-deleteGone}} flag which instead of just > deleting gone and duplicated documents also deletes redirects from the index. -- This message was sent by Atlassian Jira (v8.3.4#803005)