Sebastian Nagel created NUTCH-2482:
--------------------------------------

             Summary: index-geoip not to add null values to document fields
                 Key: NUTCH-2482
                 URL: https://issues.apache.org/jira/browse/NUTCH-2482
             Project: Nutch
          Issue Type: Bug
          Components: indexer, plugin
    Affects Versions: 1.13
            Reporter: Sebastian Nagel
            Priority: Minor
             Fix For: 1.15


The plugin index-geoip may add null values to document fields which then cause 
further errors, here a NPE in IndexingFiltersChecker when toString() is called 
on null:
{noformat}
$ bin/nutch indexchecker -Dstore.ip.address=true 
-Dindex.geoip.usage=cityDatabase \
     -Dplugin.includes="protocol-http|parse-html|index-(basic|geoip)" 
http://www.example.com/
...
Exception in thread "main" java.lang.NullPointerException
        at 
org.apache.nutch.indexer.IndexingFiltersChecker.fetch(IndexingFiltersChecker.java:340)
        at 
org.apache.nutch.indexer.IndexingFiltersChecker.run(IndexingFiltersChecker.java:127)
        at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:70)
        at 
org.apache.nutch.indexer.IndexingFiltersChecker.main(IndexingFiltersChecker.java:370)
{noformat}



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Reply via email to