You can create a core manually in the file system, in a specific place where
solr looks for cores when it starts up. I have mine in /opt/solr/server/sol. It
at least works in solr 5.4.1 (I haven't tried others).
The core needs a conf dir and a properties file. The properties file should
contain a property that points to the actual data directory. The conf dir
contains schema.xml and a bunch of other files. So, for a core named "popular",
I have
/opt/solr/server/solr popular core.properties conf
schema.xml (other files, including stopwords.txt)
popular_data (if initially empty, solr creates subdirectories here)
You may find more information by googling <solr core.properties instance
directory>
From: Chip Calhoun <[email protected]>
To: "[email protected]" <[email protected]>
Sent: Monday, February 6, 2017 7:10 AM
Subject: RE: Indexing urlmeta fields into Solr 5.5.3 (Was RE: Failing to index
from Nutch 1.12 to Solr 5.5.3)
Hi André,
Yes, my schema.xml has field definitions for 5 new fields I index using Nutch
urlmeta. So I definitely need to make sure it's being read.
It looks like I'll need to scrap this Solr core and build a new one. I had
created this one using:
/opt/solr/bin/solr create_core -c phfaws -d basic_configs
...and that got me a managed_schema rather than a schema.xml. Is there a way to
build a core that will definitely use schema.xml?
Chip
________________________________________
From: André Schild [[email protected]]
Sent: Saturday, February 04, 2017 3:26 AM
To: [email protected]
Subject: AW: Indexing urlmeta fields into Solr 5.5.3 (Was RE: Failing to index
from Nutch 1.12 to Solr 5.5.3)
Hello Chip,
>We've found that the solrindex process chokes on the custom metadata fields I
>added to my Nutch using the urlmeta plugin. A sample of the lengthy error
>>messages:
>
>java.lang.Exception:
>org.apache.solr.client.solrj.impl.HttpSolrClient$RemoteSolrException: Error
>from server at http://localhost:8983/solr/phfaws: ERROR:
>>[doc=http://academics.wellesley.edu/lts/archives/3/3L_Astronomy.html] unknown
>field 'icosreposurl'
> at
>org.apache.hadoop.mapred.LocalJobRunner$Job.runTasks(LocalJobRunner.java:462)
>
>As mentioned in my previous message, I've copied my Nutch schema.xml into my
>Solr's conf folder, but since my Solr instance hadn't already had a
>>schema.xml file I'm not convinced it's being read.. How do I set up my Solr
>to take these new fields?
Does that schema.xml file contains a definition for a field named
"icosreposurl"?
If not, then you have to add it. The example schema.xml does not handle all
cases possible with nutch.
André