Sebastian Nagel created NUTCH-2482:
--------------------------------------
Summary: index-geoip not to add null values to document fields
Key: NUTCH-2482
URL: https://issues.apache.org/jira/browse/NUTCH-2482
Project: Nutch
Issue Type: Bug
Components: indexer, plugin
Affects Versions: 1.13
Reporter: Sebastian Nagel
Priority: Minor
Fix For: 1.15
The plugin index-geoip may add null values to document fields which then cause
further errors, here a NPE in IndexingFiltersChecker when toString() is called
on null:
{noformat}
$ bin/nutch indexchecker -Dstore.ip.address=true
-Dindex.geoip.usage=cityDatabase \
-Dplugin.includes="protocol-http|parse-html|index-(basic|geoip)"
http://www.example.com/
...
Exception in thread "main" java.lang.NullPointerException
at
org.apache.nutch.indexer.IndexingFiltersChecker.fetch(IndexingFiltersChecker.java:340)
at
org.apache.nutch.indexer.IndexingFiltersChecker.run(IndexingFiltersChecker.java:127)
at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:70)
at
org.apache.nutch.indexer.IndexingFiltersChecker.main(IndexingFiltersChecker.java:370)
{noformat}
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)