You can probably use the one from Nutch 1.12, it certainly got more updates. M.
-----Original message----- > From:Tom Chiverton <[email protected]> > Sent: Friday 14th October 2016 17:44 > To: [email protected] > Subject: Re: Nutch 2, Solr 5 - solrdedup causes ClassCastException: > > OK, so where is a known good one for Solr 5 ? Maybe Felix could post his > and we could put that in the Nutch distro ? > > Tom > > > On 14/10/16 16:39, Markus Jelsma wrote: > > Yes, that file is probably incredible old and never maintained. You can > > safely remove those options from the schema. > > M. > > > > > > > > -----Original message----- > >> From:Tom Chiverton <[email protected]> > >> Sent: Friday 14th October 2016 17:34 > >> To: [email protected] > >> Subject: Re: Nutch 2, Solr 5 - solrdedup causes ClassCastException: > >> > >> Hmm, that makes a certain sense. > >> > >> But then I seem to be getting errors like > >> > >> ERROR: Error CREATEing SolrCore 'nutch': Unable to create core [nutch] > >> Caused by: enablePositionIncrements is not a valid option as of Lucene 5.0 > >> > >> Like the Nutch schema is for a much older Solr ? > >> > >> What version were you running ? > >> > >> On 14/10/16 16:17, Felix von Zadow wrote: > >>> I had the same problem a while ago; I accidentally forgot to supply the > >>> schema when creating a core and had the digest/string*s* problem. Here's > >>> some more explanation of what I did, I hope I remember correctly: > >>> > >>> I am using (just like Markus suggested) the schema.xml from nutch which > >>> sets > >>> <fieldType name="string" class="solr.StrField" ... > >>> and > >>> <field name="digest" type="string" ... > >>> > >>> I duplicated the data_driven_schema_configs/conf/ configset and replaced > >>> the managed-schema file with Nutch's schema.xml so I have: > >>> [...]/solr/configsets/my_config/managed_schema > >>> > >>> Core is created like so: > >>> solr create -c corename -d my_config > >>> > >>> > >>> Hope that helps, > >>> Felix > >>> > >>>> -----Ursprüngliche Nachricht----- > >>>> Von: Markus Jelsma [mailto:[email protected]] > >>>> Gesendet: Freitag, 14. Oktober 2016 17:05 > >>>> An: [email protected] > >>>> Betreff: RE: Nutch 2, Solr 5 - solrdedup causes ClassCastException: > >>>> > >>>> Yes, current Solr comes with a schemaless mode, which can cause errors in > >>>> some cases. It must be configured to use classic schema mode, then you > >>>> provide > >>>> it with the schema.xml file you can find in Nutch' conf directory. > >>>> > >>>> M. > >>>> > >>>> > >>>> > >>>> -----Original message----- > >>>>> From:Tom Chiverton <[email protected]> > >>>>> Sent: Friday 14th October 2016 16:58 > >>>>> To: [email protected] > >>>>> Subject: Re: Nutch 2, Solr 5 - solrdedup causes ClassCastException: > >>>>> > >>>>> I don't understand what you mean here. I am not a Solr expert, though > >>>>> I've used it a bit in the past, though not with Nutch. > >>>>> > >>>>> Is there a schema I should be feeding it ? > >>>>> > >>>>> Tom > >>>>> > >>>>> > >>>>> On 14/10/16 15:50, Markus Jelsma wrote: > >>>>>> Solr supports schemaless mode, which may be your case. Perhaps it made > >>>> your digest field multi valued. I'd suggest to use Solr's classic schema > >>>> factory, > >>>> and a fixed schema. > >>> ______________________________________________________________________ > >>> This email has been scanned by the Symantec Email Security.cloud service. > >>> For more information please visit http://www.symanteccloud.com > >>> ______________________________________________________________________ > >>> > >> > > ______________________________________________________________________ > > This email has been scanned by the Symantec Email Security.cloud service. > > For more information please visit http://www.symanteccloud.com > > ______________________________________________________________________ > > > >

