I imagine there is an easy answer to this, but I just can't find it -- any
pointers appreciated.
I am using Nutch 1.6 and Solr 4.
When indexing documents that have been successfully crawled, I hit an
occasional document that does not match the Solr schema. For example, a
document which has two address fields when Solr schema.xml does not specify
address as being multi-valued.
Ideally, I would like this document to be skipped, an error written to the
log file but the indexing of the remainder of the parsed documents to
continue. Instead the job fails.
I have tried setting
<abortOnConfigurationError>${solr.abortOnConfigurationError:false}</abortOnC
onfigurationError> in solrconfig.xml and restarting tomcat, but that does
not seem to make a difference.
Where else should I be looking?