You can create a core manually in the file system, in a specific place where 
solr looks for cores when it starts up. I have mine in /opt/solr/server/sol. It 
at least works in solr 5.4.1 (I haven't tried others).

The core needs a conf dir and a properties file. The properties file should 
contain a property that points to the actual data directory. The conf dir 
contains schema.xml and a bunch of other files. So, for a core named "popular", 
I have
/opt/solr/server/solr    popular        core.properties        conf            
schema.xml            (other files, including stopwords.txt)        
popular_data            (if initially empty, solr creates subdirectories here)
You may find more information by googling <solr core.properties instance 
directory>

      From: Chip Calhoun <[email protected]>
 To: "[email protected]" <[email protected]> 
 Sent: Monday, February 6, 2017 7:10 AM
 Subject: RE: Indexing urlmeta fields into Solr 5.5.3 (Was RE: Failing to index 
from Nutch 1.12 to Solr 5.5.3)
   
Hi André,

Yes, my schema.xml has field definitions for 5 new fields I index using Nutch 
urlmeta. So I definitely need to make sure it's being read. 

It looks like I'll need to scrap this Solr core and build a new one. I had 
created this one using:
/opt/solr/bin/solr create_core -c phfaws -d basic_configs
...and that got me a managed_schema rather than a schema.xml. Is there a way to 
build a core that will definitely use schema.xml?

Chip


________________________________________
From: André Schild [[email protected]]
Sent: Saturday, February 04, 2017 3:26 AM
To: [email protected]
Subject: AW: Indexing urlmeta fields into Solr 5.5.3 (Was RE: Failing to index 
from Nutch 1.12 to Solr 5.5.3)

Hello Chip,

>We've found that the solrindex process chokes on the custom metadata fields I 
>added to my Nutch using the urlmeta plugin. A sample of the lengthy error 
>>messages:
>
>java.lang.Exception: 
>org.apache.solr.client.solrj.impl.HttpSolrClient$RemoteSolrException: Error 
>from server at http://localhost:8983/solr/phfaws: ERROR: 
>>[doc=http://academics.wellesley.edu/lts/archives/3/3L_Astronomy.html] unknown 
>field 'icosreposurl'
>    at 
>org.apache.hadoop.mapred.LocalJobRunner$Job.runTasks(LocalJobRunner.java:462)
>
>As mentioned in my previous message, I've copied my Nutch schema.xml into my 
>Solr's conf folder, but since my Solr instance hadn't already had a 
>>schema.xml file I'm not convinced it's being read.. How do I set up my Solr 
>to take these new fields?

Does that schema.xml file contains a definition for a field named 
"icosreposurl"?
If not, then you have to add it. The example schema.xml does not handle all 
cases possible with nutch.

André




   

Reply via email to