[
https://issues.apache.org/jira/browse/NUTCH-2670?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16667107#comment-16667107
]
Yossi Tamari commented on NUTCH-2670:
-------------------------------------
This is reproducible. nutch-site and nutch-default are not visible to
IndexerMapReduce.
[~aquaticwater], to set this to true, you need to pass -deleteGone to the Index
command, and that sets the value in the conf. See IndexingJob, around line 210.
I'm not sure if this is a bug, though, there is no indication that it should be
read from the XMLs.
> org.apache.nutch.indexer.IndexerMapReduce does not read the value of
> "indexer.delete" from nutch-site.xml
> ---------------------------------------------------------------------------------------------------------
>
> Key: NUTCH-2670
> URL: https://issues.apache.org/jira/browse/NUTCH-2670
> Project: Nutch
> Issue Type: Bug
> Components: indexer
> Affects Versions: 1.14, 1.15
> Environment: macOS Mojave and High Sierra
> MacBook Pro (Retina, 13-inch, Mid 2014)
> Oracle Java 1.8.0_144-b01 and previous versions
> Reporter: Junqiang Zhang
> Priority: Minor
>
> Inside org.apache.nutch.indexer.IndexerMapReduce.IndexerReducer, the setup()
> function should read the value of "indexer.delete" from nutch-site.xml, and
> assign the value to the variable of "delete". See the following line of code.
> (line 201) delete = conf.getBoolean(INDEXER_DELETE, false);
> However, the value of "indexer.delete" set in nutch-site.xml and
> nutch-default.xml is not assigned to the variable, "delete". I put the
> following setting in one of nutch-site.xml and nutch-default.xml, or in both
> of them. The variable of "delete" remains false.
> <property>
> <name>indexer.delete</name>
> <value>true</value>
> <description>Whether the indexer will delete documents GONE or REDIRECTS by
> indexing filters
> </description>
> </property>
> I also changed the line of code to
> delete = conf.getBoolean(INDEXER_DELETE, true);
> Whatever value of "indexer.delete" is set in nutch-site.xml or
> nutch-default.xml, the value of "delete" remains false.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)