[ https://issues.apache.org/jira/browse/NUTCH-2670?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16667105#comment-16667105 ]
Sebastian Nagel commented on NUTCH-2670: ---------------------------------------- The property "indexer.delete" is overwritten by the command-line option "-deleteGone" by the IndexerJob (`bin/nutch index`). Setting it in the nutch-site.xml has no effect. > org.apache.nutch.indexer.IndexerMapReduce does not read the value of > "indexer.delete" from nutch-site.xml > --------------------------------------------------------------------------------------------------------- > > Key: NUTCH-2670 > URL: https://issues.apache.org/jira/browse/NUTCH-2670 > Project: Nutch > Issue Type: Bug > Components: indexer > Affects Versions: 1.14, 1.15 > Environment: macOS Mojave and High Sierra > MacBook Pro (Retina, 13-inch, Mid 2014) > Oracle Java 1.8.0_144-b01 and previous versions > Reporter: Junqiang Zhang > Priority: Minor > > Inside org.apache.nutch.indexer.IndexerMapReduce.IndexerReducer, the setup() > function should read the value of "indexer.delete" from nutch-site.xml, and > assign the value to the variable of "delete". See the following line of code. > (line 201) delete = conf.getBoolean(INDEXER_DELETE, false); > However, the value of "indexer.delete" set in nutch-site.xml and > nutch-default.xml is not assigned to the variable, "delete". I put the > following setting in one of nutch-site.xml and nutch-default.xml, or in both > of them. The variable of "delete" remains false. > <property> > <name>indexer.delete</name> > <value>true</value> > <description>Whether the indexer will delete documents GONE or REDIRECTS by > indexing filters > </description> > </property> > I also changed the line of code to > delete = conf.getBoolean(INDEXER_DELETE, true); > Whatever value of "indexer.delete" is set in nutch-site.xml or > nutch-default.xml, the value of "delete" remains false. -- This message was sent by Atlassian JIRA (v7.6.3#76005)