Michael Joyce created NUTCH-1987:
------------------------------------
Summary: Make bin/crawl indexer agnostic
Key: NUTCH-1987
URL: https://issues.apache.org/jira/browse/NUTCH-1987
Project: Nutch
Issue Type: Improvement
Affects Versions: 1.9
Reporter: Michael Joyce
Fix For: 1.10
The crawl script makes it a bit challenging to use an indexer that isn't Solr.
For instance, when I want to use the indexer-elastic plugin I still need to
call the crawler script with a fake Solr URL otherwise it will skip the
indexing step all together.
{code}
bin/crawl urls/ crawl/ "http://fakeurl.com:9200" 1
{code}
It would be nice to keep configuration for the Solr indexer in the conf files
(to mirror the elastic search indexer conf and others) and to make the indexing
parameter simply toggle whether indexing does or doesn't occur instead of also
trying to configure the indexer at the same time.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)