Hi André, Yes, my schema.xml has field definitions for 5 new fields I index using Nutch urlmeta. So I definitely need to make sure it's being read.
It looks like I'll need to scrap this Solr core and build a new one. I had created this one using: /opt/solr/bin/solr create_core -c phfaws -d basic_configs ...and that got me a managed_schema rather than a schema.xml. Is there a way to build a core that will definitely use schema.xml? Chip ________________________________________ From: André Schild [[email protected]] Sent: Saturday, February 04, 2017 3:26 AM To: [email protected] Subject: AW: Indexing urlmeta fields into Solr 5.5.3 (Was RE: Failing to index from Nutch 1.12 to Solr 5.5.3) Hello Chip, >We've found that the solrindex process chokes on the custom metadata fields I >added to my Nutch using the urlmeta plugin. A sample of the lengthy error >>messages: > >java.lang.Exception: >org.apache.solr.client.solrj.impl.HttpSolrClient$RemoteSolrException: Error >from server at http://localhost:8983/solr/phfaws: ERROR: >>[doc=http://academics.wellesley.edu/lts/archives/3/3L_Astronomy.html] unknown >field 'icosreposurl' > at > org.apache.hadoop.mapred.LocalJobRunner$Job.runTasks(LocalJobRunner.java:462) > >As mentioned in my previous message, I've copied my Nutch schema.xml into my >Solr's conf folder, but since my Solr instance hadn't already had a >>schema.xml file I'm not convinced it's being read.. How do I set up my Solr >to take these new fields? Does that schema.xml file contains a definition for a field named "icosreposurl"? If not, then you have to add it. The example schema.xml does not handle all cases possible with nutch. André

