I ended up building my own 'nutch' configset - the managed-schema is the only tricky thing to get right.

Is it worth removing the schema.xml from Nutch (as it doesn't work with Solr 5 or 6) and replacing with a suggested config set to use instead ?

Tom


On 20/10/16 08:35, lewis john mcgibbney wrote:
Hi Tom,
This looks like it has been frustrating for you so I've provided a walk
through of how I can set up a core using current Nutch 2.X schema.xml

On Mon, Oct 17, 2016 at 9:27 AM, <[email protected]> wrote:

From: Tom Chiverton <[email protected]>
To: [email protected]
Cc:
Date: Mon, 17 Oct 2016 09:55:53 +0100
Subject: Re: Nutch 2, Solr 5 - solrdedup causes ClassCastException:
I tried that, and it still gives

ERROR: Error CREATEing SolrCore 'nutch': Unable to create core [nutch]
Caused by: enablePositionIncrements is not a valid option as of Lucene 5.0

Tom


lmcgibbn@LMC-056430 /usr/local/solr-6.2.1 $ cp
/usr/local/nutch2/conf/schema.xml example/files/conf/
lmcgibbn@LMC-056430 /usr/local/solr-6.2.1 $ ./bin/solr start
Waiting up to 30 seconds to see Solr running on port 8983 [/]
Started Solr server on port 8983 (pid=49222). Happy searching!

lmcgibbn@LMC-056430 /usr/local/solr-6.2.1 $ ./bin/solr create -c nutch -d
/usr/local/solr-6.2.1/example/files/conf -p 8983

Copying configuration to new core instance directory:
/usr/local/solr-6.2.1/server/solr/nutch

Creating new core 'nutch' using command:
http://localhost:8983/solr/admin/cores?action=CREATE&name=nutch&instanceDir=nutch

{
   "responseHeader":{
     "status":0,
     "QTime":1657},
   "core":"nutch"}

I can now run my crawls on Nutch 2.X. Can you please replicate the above
then tell me where and if anything goes wrong?
Thanks
Lewis


______________________________________________________________________
This email has been scanned by the Symantec Email Security.cloud service.
For more information please visit http://www.symanteccloud.com
______________________________________________________________________

Reply via email to